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The invention features ABCl nucleic acids and polypeptides for the diagnosis and trefJ^ent of abnormal choleslerol regulation. The 
invention also features methods for identifying compounds for modulating cholesterol levels in an animal (e.g., a human). .(v 
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15 



Back ground nf rhe TnvRnrinn 
Low HDL cholesterol (HDL-C), or hypoalphalipoproteinemia, is a blood lipid 
abnormality which correlates with a high risk of cardiovascular disease (CVD), in 
particular coronary artery disease (CAD)» but also cerebrovascular disease, coronary 
2Q 10 restenosis, and peripheral vascular disease. HDL, or *good cholesterol* levels axe 

influenced by both environmental and genetic factors. 

Epidemiological studies have consistently demonstrated that plasma HDL-C 
concentration is inversely related to the incidence of CAD. HDL-C levels are a strong 
graded and independent cardiovascular risk factor. Protective effects of an elevated 
15 HDL-C persist until 80 years of age. A low HDL-C is associated with an increased CAD 
risk even with normal (<5.2 mmol/1) total plasma cholesterol levels. Coronary disease 
risk is increased by 2% in men and 3% in women for every 1 mg/dL (0.026 mmol/1) 
reduction in HDL-C and in the majority of studies this relationship is statistically 
significant even after adjustment for other lipid and non-lipid risk factors. Decreased 
20 HDL-C levels are the most common lipoprotein abnormality seen in patients with 
35 premature CAD. Four percent of patients with premature CAD with have an isolated 

form of decreased HDL-C levels with no other lipoprotein abnormalities while 25% have 
low HDL levels with accompanying hypertriglycerid^ia. 

Even in the face of other dyslipidemias or secondary factors, HDL-C levels are 

40 

25 important predictors of CAD. In a cohort of diabetics, those with isolated low HDL 
cholesterol had a 65% increased death rate compared to diabetics with normal HDL 
cholesterol levels (>0.9 mmol/1). Furthermore, it has been shown that even within high 
45 risk populations, such as those with familial hypercholesterolemia, HDL cholesterol level 

is an important predictor of CAD. Low HDL cholesterol levels thus constitute a major, 
30 independent, risk for CAD, 
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These findings have led to increased attention to HDL cholesterol levels as a 
focus for treatment, following the recommendations of the National Cholesterol 
10 Education Program. These guidelines suggest that HDL cholesterol values below 0.9 

mmol/1 confer a significant risk for men and women. As such, nearly half of patients 
5 with CAD would have low HDL cholesterol. It is therefore crucial that we obtain a better 
understanding of factors which contribute to this phenotype. In view of the fiact that 

15 

pharmacological intervention of low HDL cholesterol levels has so far proven 
unsatisfactory, it is also important to imderstand the factors that regulate these levels in 
the circulation as this understanding may reveal new therapeutic targets. 
20 10 Absolute levels of HDL cholesterol may not always predict risk of CAD. In the 

case of CETP deficiency, individuals display an increased risk of developing CAD, 
despite increased HDL cholesterol levels. What seems to be important in this case is the 
functional activity of the reverse cholesterol transport pathway, the process by which 
intracellular cholesterol is trafficked out of the cell to acceptor proteins such as ApoAI 
15 or HDL. Other important genetic detemiinants of HDL cholesterol levels, and its inverse 
relation with CAD, may reside in the processes leading to HDL formation and 
intracellular cholesterol trafScking and efflux. To date, this process is poorly understood, 
however, and clearly not all of the components of this pathway have been identified. 
Thus, defects preventing proper HDL-mediated cholesterol efflux may be important 
20 predictors of CAD. Therefore it is critical to identify and understand novel genes 
35 involved in the intracellular cholesterol trafficking and efflux pathways. 

HDL particles are central to the process of reverse cholesterol transport and 
thus to the maintenance of tissue cholesterol homeostasis. This process has multiple 
steps which include the binding of HDL to cell surface components, the acquisition of 
25 cholesterol by passive absorption, the esterification of this cholesterol by LCAT and 
the subsequent transfer of esterified cholesterol by CETP, to VLDL and chylomicron 
remnants for liver uptake. Each of these steps is known to impact the plasma 
45 concentration of HDL. 

Changes in genes for ApoAI-CIII, lipoprotein lipase, CETP, hepatic lipase, 
30 and LCAT all contribute to determination of HDL-C levels in humans. One rare form 
of genetic HDL deficiency is Tangier disease (TD), diagnosed in approximately 40 

50 
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patients world-wide, and associated with almost connplete absence of HDL cholesterol 
(HDL-C) levels (listed in OMIM as an autosomal recessive trait (OMIM 205400)), 
These patients have very low HDL cholesterol and ApoAI levels, which have been 
ascribed to hypercatabolism of nascent HDL and ApoAI, due to a delayed acquisition 
5 of lipid and resulting failure of conversion to mature HDL. TD patients accumulate 
cholesterol esters in several tissues, resulting in characteristic features, such as 

15 

enlarged yellow tonsils, hepatosplenomegaly, peripheral neuropathy, and cholesterol 
ester deposition in the rectal mucosa. Defective removal of cellular cholesterol and 
phospholipids by ApoAI as well as a marked deficiency in HDL mediated efflux of 

10 intracellular cholesterol has been demonstrated in TD fibroblasts. Even though this is 
a rare disorder, defining its molecular basis could identify pathways relevant for 
cholesterol regulation in the general population. The decreased availability of free 
cholesterol for efflux in the surface membranes of cells in Tangier Disease patients 
appears to be due to a defect in cellular lipid metabolism or trafficking. 

15 Approximately 45% of Tangier patients have signs of premature CAD, suggesting a 
strong link between decreased cholesterol efflux, low HDL cholesterol and CAD. As 
increased cholesterol is observed in the rectal mucosa of persons with TD, the 
molecular mechanism responsible for TD may also regulate cholesterol adsorption 
from the gastrointestinal (GI) tract. 

20 A more common form of genetic HDL deficiency occurs in patients who have 

35 low plasma HDL cholesterol usually below the 5th percentile for age and sex (OMIM 

10768), but an absence of clinical manifestations specific to Tangier disease (Marcil 
et aL, Arterioscler. Thromb. Vase. Biol. 19:159-169, 1999; Marcil et al., Axterioscler. 
Thromb. Vase. Biol. 15:1015-1024, 1995). These patients have no obvious 

40 

25 environmental factors associated with this lipid phenotype, and do not have severe 

hypertriglyceridemia nor have known causes of severe HDL deficiency (mutations in 
ApoAI, LCAT, or LPL deficiency) and are not diabetic. The pattern of inheritance of 
45 this condition is most consistent with a Mendelian dominant trait (OMIM 1 0768). 

The development of drugs that regulate cholesterol metabolism has so far 
30 progressed slowly. Thus, there is a need for a better understanding of the genetic 
components of the cholesterol efflux pathway. Newly-discovered components can 
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then serve as targets for drug design. 

Low HDL levels are likely to be due to multiple genetic fectors. The use of 
pharmacogenomics in the aid of designing treatment tailored to the patient makes it 
desirable to identify polymorphisms in components of the cholesterol efQux pathway. 
5 An understanding of the effect of these polymorphisms on protein function would 
allow for the design of a therapy that is optimal for the patient. 

Summary of the Invention 
In a first aspect, the invention features a substantially pure ABCl polypeptide 
10 having ABCl biological activity. Preferably, the ABCl polypeptide is human ABCl 
(e.g., one that includes amino acids 1 to 60 or amino acids 61 to 2261 of SEQ ID NO: 
1). In one preferred embodiment, the ABCl polypeptide includes amino acids 1 to 
2261 of SEQ ID NO: 1. 

Specifically excluded from the polypeptides of the invention are the 
15 polypeptide having the exact amino acid sequence as GenBank accession number 
C A Al 0005.1 and the nucleic acid having the exact sequence as AJO 123 76.1. Also 
excluded is protein having the exact amino acid sequence as GenBank accession 
number X75926. 

In a related aspect, the invention features a substantially pure ABC 1 
20 polypeptide that includes amino acids 1 to 2261 of SEQ ID NO; 1 . 

In another aspect, the invention features a substantially pure nucleic acid 
molecule encoding an ABCl polypeptide having ABCl biological activity (e.g., a 
nucleic acid molecule that includes nucleotides 75 to 254 or nucleotides 255 to 6858 
of SEQ ID NO: 2). In one preferred embodiment, the nucleic acid molecule includes 
25 nucleotides 75 to 6858 of SEQ ID NO: 2. , 
In a related aspect, the invention featnres an expression vector, a cell, or a non- 
human mammal that includes the nucleic acid molecule of the invention. 

In yet another aspect, the invention features a substantially pure nucleic acid 
molecule that includes nucleotides 75 to 254 of SEQ ID NO: 2, nucleotides 255 to 
30 685 8 of SEQ ID NO; 2, or nucleotides 75 to 6858 of SEQ ID NO: 2. 
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In still another aspect, the invention features a substantially pxire nucleic acid 
molecule that includes at least fifteen nucleotides corresponding to the 5' or 3' 
untranslated region from a human ABCl gene. Preferably, the 3' untranslated region 
includes nucleotides 7015-7860 of SEQ ID NO: 2. 
5 In a related aspect, the invention features a substantially pure nucleic acid 

molecule that hybridizes at high stringency to a probe comprising nucleotides 701 5- 
7860ofSEQIDNO:Z 

In another aspect, the invention features a method of treating a human having 
low HDL cholesterol or a cardiovascular disease, including administering to the 

10 human an ABCl polypeptide, or cholesterol-regulating fragment thereof, or a nucleic 
acid molecule encoding an ABCl polypeptide, or cholesterol-regulating fragment 
thereof. In a preferred embodiment, the human has a low HDL cholesterol level 
relative to normal. Preferably^ the ABCl polypeptide is wild-type ABCl , or has a 
mutation increases its stability or its biological activity. A preferred biological 

15 activity is regulation of cholesterol. 

In a related aspect, the invention features a me&od of preventing or treating 
cardiovascular disease, including introducing into a human an expression vector 
comprising an ABCl nucleic acid molecule operably linked to a promoter and 
encoding an ABCl polypeptide having ABCl biological activity. 

20 In another related aspect, the invention features a method of preventing or 

ameliorating the effects of a disease-causing mutation in an ABCl gene, including 
introducing into a human an expression vector comprising an ABCl nucleic acid 
molecule operably linked to a promoter and encoding an ABCl polyp^tide having 
ABCl biological activity. 

25 In still another aspect, the invention features a method of treating or 

preventing cardiovascular disease, including administering to an animal (e.g., a 
human) a compound that mimes the activity of wild-type ABCl or modulates the 
biological activity of ABCL 

One preferred cardiovascular disease that can be treated using the methods of 

30 the invention is coronary artery disease. Others include cerebrovascular disease and 
peripheral vascular disease. 
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The discovery that the ABCl gene and protein are involved in cholesterol 
transport that affects sermn HDL levels allows the ABCl protem and gene to be used 
in a variety of diagnostic tests and assays for identification of HDL-increasing or 
CVD-inhibiting drugs. In one family of such assays, the ability of domains of the 
5 ABCl protein to bind ATP is utilized; compounds that enhance this binding are 
potential HDL-increasing drugs. Similarly, the anion transport capabilities and 
membrane pore-forming functions in cell membranes can be used for drug screening. 

ABCl expression can also serve as a diagnostic tool for low HDL or CVD; 
determination of the genetic subtyping of the ABCl gene sequence can be used to 
10 subtype low HDL individuals or families to determine whether the low HDL 
phenotype is related to ABCl function. This diagnostic process can lead to the 
tailoring of drug treatments according to patient genotype (referred to as 
pharmacogenomics), including prediction of the patient*s response (e.g., increased or 
decreased efficacy or imdesired side effects upon administration of a compound or 
15 drug. 

Antibodies to an ABCl polypeptide can be used both as therapeutics and 
diagnostics. Antibodies are produced by immunologically challenging a B-cell- 
containing biological system, e.g., an animal such as a mouse, with an ABCl • 
polypeptide to stimulate production of anti-ABC 1 protein by the B-ceiis, followed by 

20 isolation of the antibody fix>m the biological system. Such antibodies can be used to 
measure ABCl polypeptide in a biological sample such as serum, by contacting the 
sample with the antibody and then measuring immune complexes as a measure of the 
ABCl polypeptide in the sample. Antibodies to ABCl can also be used as 
therapeutics for the modulation of ABCl biological activity. 

25 Thus, in another aspect, the invention features a purified antibody that 

specifically binds to ABCl. 

In yet another aspect, the invention features a method for determining whether 
a candidate compound modulates ABCl biological activity, comprising: (a) providing 
an ABCl polypeptide; (b) contacting the ABCl polypeptide with the candidate 

30 compound; and (c) measuring ABCl biological activity, wherein altered ABCl 

biological activity, relative to an ABCl polypeptide not contacted with the compound. 
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indicates that the candidate compound modulates ABCl biological activity. 
Preferably, the ABCl polypeptide is in a cell or is in a cell-free assay system. 

In still another- aspect, the invention features a method for determining 
whether a candidate compound modulates ABCl expression. The method includes 
5 (a) providing a nucleic acid molecule comprising an ABCl promoter operably linked 
to a reporter gene; (b) contacting the nucleic acid molecule with the candidate 
compoimd; and (c) measuring reporter gene expression^ wherein altered reporter gene 
expression, relative to a nucleic acid molecule not contacted with the compound, 
indicates that the candidate compound modulates ABCl expression. 

10 In another aspect, the invention features a method for determining whether 

candidate compound is useful for modulating cholesterol levels, the method including 
the steps of: (a) providing an ABCl polypeptide; (b) contacting the polypeptide with 
the candidate compoimd; and (c) measuring binding of the ABCl polypeptide, 
wherein binding of the ABCl polypeptide indicates that the candidate compound is 

15 useful for modulating cholesterol levels. 

In a related aspect, the invention features method for determining whether a 
candidate compound mimics ABCl biological activity. The method includes (a) 
providing a cell that is not expressing an ABCl polypeptide; (b) contacting the cell 
with the candidate compound; and (o) measuring ABCl biological activity of the cell, 

20 wherein altered ABCl biological activity, relative to a cell not contacted with the 
compound, indicates that the candidate compound modulates ABCl biological 
activity. Preferably, the cell has an ABCl null mutation. In one preferred 
embodiment, the cell is in a mouse or a chicken (e.g., a WHAM chicken) in which its 
ABC J gene has been mutated. 

25 In still another aspect, the invention features a method for determining 

whether a candidate compound is useful for the treatment of low HDL cholesteroL 
The method includes (a) providing an ABC transporter (e.g., ABCl); (b) contacting 
the transporter with the candidate compound; and (c) measuring ABC transporter 
biological activity, wherein increased ABC transporter biologica] activity, relative to a 

30 transporter not contacted with the compound, indicates that the candidate compound 
is useful for the treatment of low HDL cholesterol. Preferably the ABC transporter is 

-7- 
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in a cell or a cell free assay system. 

In yet another aspect, the invention features a method for deteniiining whether 
^0 candidate compound is useful for modulating cholesterol levels* The method includes 

(a) providing a nucleic acid molecule comprising an ABC transporter promoter 
5 operabiy linked to a reporter gene; (b) contacting the nucleic acid molecule with the 
candidate compound; and (c) measuring expression of the reporter gene, wherein 
increased expression of the reporter gene, relative to a nucleic acid molecule not 
contacted with the compound, indicates that the candidate compound is useful for 
modulating cholesterol levels. 
10 In still another aspect, the invention features a method for determining 

whether a candidate compound increases the stability or decreases the regulated 
cataboHsm of an ABC transporter polypeptide. The method includes (a) providing an 
ABC transporter polypeptide; (b) contacting the transporter with the candidate 
25 compound; and (c) measuring the half-life of the ABC transporter polypeptide, 

15 wherein an increase in the half-life, relative to a transporter not contacted with the 

compound, indicates that the candidate compound increases the stability or decreases 
the regulated catabolism of an ABC transporter polypeptide. Preferably the ABC 
transporter is in a cell or a cell free assay system. 

In a preferred embodiment of the screening methods of the present invention, 
20 the cell is in an animal. The preferred ABC transporters are ABCl, ABC2, ABCR, 
35 and ABC8, and the preferred biological activity is transport of cholesterol (e.g., HDL 

cholesterol or LDL cholesterol) or interleukin-1 , or is binding or hydrolysis of ATP 
by the ABCl polypeptide. 

Preferably, the ABCl polypeptide used in the screening methods includes 

40 

25 amino acids 1-60 of SEQ ID NO; 1 . Alternatively, the ABC 1 polypeptide can include 

a region encoded by a nucleotide sequence that hybridizes under high stringency 

conditions to nucleotides 75 to 254 of SEQ ID NO: 2. 
45 In another aspect, the invention features a method for determining whether a 

patient has an increased risk for cardiovascular disease. The method includes 
30 determining whether an ABCl gene of the patient has a mutation, wherein a mutation 

indicates that the patient has an increased risk for cardiovascular disease. 
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In related aspect, the invention features a method for determining whether a 
patient has an increased risk for cardiovascular disease. The method includes 
determining whether an ABCl gene of the patient has a polymorphism, wherein a 
polymorphism indicates that the patient has an increased risk for cardiovascular 
5 disease. 

In another aspect, the invention features a method for determining whether a 
patient has an increased risk for cardiovascular disease. The method includes 
measuring ABCl biological activity in the patient, wherein increased or decreased 
levels in the ABCl biological activity, relative to normal levels, indicates that the 

10 patient has an increased risk for cardiovascular disease. 

In still another aspect, the invention features a method for determining 
whether a patient has an increased risk for cardiovascular disease. The method 
includes measuring ABC 1 expression in the patient, wherein decreased levels in the 
ABCl expression relative to normal levels, indicates that the patient has an increased 

15 risk for cardiovascular disease. Preferably, the ABCl expression is determined by 
measuring levels of ABCl polyp^tide or ^Ci RNA. 

In another aspect, the invention features a non-human mammal having a 
transgene comprising a nucleic acid molecule encoding a mutated ABCl polypeptide. 
In one embodiment, the mutation is a dominant-negative mutation. 

20 In a related aspect, the invention features a non^human mammal, having a 

transgene that includes a nucleic acid molecule encoding an ABCl polypeptide 
having ABCl biological activity. 

In another related aspect, the invention features a cell firom a non-human 
mammal having a transgene that includes a nucleic acid molecule encoding an ABCl 

25 polypeptidehaving ABCl biological activity. 

In still another aspect, the invention features a method for determining 
whether a candidate compound decreases the inhibition of a dominant-negative ABCl 
polypeptide. The method includes (a) providing a cell expressing a dominant- 
negative ABCl polypeptide; (b) contacting the cell with the candidate compound; and 

30 (c) measuring ABCl biological activity of the cell, wherein an increase in the ABCl 
biological activity, relative to a cell not contacted with the compound, indicates that 
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the candidate compound decreases the inhibition of a dominant-negative ABCl 
polypeptide. 

By '^polypeptide" is meant any chain of more than two amino acids, regardless 
of post-translational modification such as glycosylation or phosphorylation. 
5 By ^'substantially identical" is meant a polypeptide or nucleic acid exhibiting 

at least 50%, preferably 85%, more preferably 90%, and most preferably 95% identity 
to a reference amino acid or nucleic acid sequence. For polypeptides, the length of 
comparison sequences will generally be at least 16 amino acids, preferably at least 
20 amino acids, more preferably at least 25 amino acids, and most preferably 

10 35 amino acids. For nucleic acids, the length of comparison sequences will generally 
be at least 50 nucleotides, preferably at least 60 nucleotides, more preferably at least 
75 nucleotides, and most preferably 110 nucleotides. 

Sequence identity is typically mej^ured using sequence analysis software with 
the default parameters specified therein (e.g.. Sequence Analysis Software Package of 

15 the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 
University Avenue, Madison, WI 53705). This software program matches similar 
sequences by assigning degrees of homology to various substitutions, deletions, and 
other modifications. Conservative substitutions typically include substitutions within 
the following groups: glycine, alanine, valine, isoleucine, leucine; aspartic acid, 

20 glutamic acid, asparagine, glutamxne; serine, threonine; lysine, arginine; and 
phenylalanine, tyrosine. 

By '*high stringency conditions" is meant hybridization in 2X SSC at 40 ""C 
with a DNA probe length of at least 40 nucleotides. For other definitions of high 
stringency conditions, see F. Ausubel et al.. Current Protocols in Molecular Biology, 

25 pp. 6,3.1-6.3.6, John Wiley & Sons, New York, NY, 1994, hereby incorporated by 
reference. 

By **substantially pure polypeptide" is meant a polypeptide that has been 
separated fi^om the components that naturally accompany it. Typically, the 
polypeptide is substantially pure when it is at least 60%, by weight, free from the 
30 proteins and naturally-occurring organic molecules with which it is naturally 

associated. Preferably, the polypeptide is an ABCl polypeptide that is at least 75%, 
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more preferably at least 90%, and most preferably at least 99%, by weight, pure. A 
substantially pure ABCl polypeptide may be obtained, for example, by extraction 
from a natural source (e.g., a pancreatic cell), by expression of a recombinant nucleic 
acid encoding a ABCl polypeptide, or by chemically synthesizing the protein. Purity 
5 can be measured by any appropriate method, e.g., by column chromatography, 
polyacrylamide gel electrophoresis, or HPLC analysis. 

A polypeptide is substantially free of naturally associated components when it 
is separated from those contaminants that accompany it in its natural state. Thus, a 
polypeptide which is chemically synthesized or produced in a cellular system different 

10 from the cell from which it naturally originates will be substantially free from its 
naturally associated components. Accordingly, substantially pure polypeptides 
include those which naturally occur in eukaryotic organisms but are synthesized in E, 
coli or other prokaryotes. 

By '^substantially pure nucleic acid" is meant nucleic acid that is free of the 

15 genes which, in the naturally-occurring genome of the organism from which the 
nucleic acid of the invention is derived, flank the nucleic acid. The term therefore 
includes, for example, a recombinant nucleic acid that is incorporated into a vector; 
into an autonomously replicating plasmid or virus; into the genomic nucleic acid of a 
prokaryote or a eukaryote cell; or that exists as a separate molecule (e.g., a cDNA or a 

20 genomic or cDNA fragment produced by PGR or restriction endonuclease digestion) 
independent of other sequences. It also includes a recombinant nucleic acid that is 
part of a hybrid gene encoding additional polypeptide sequence. 

By **modulates" is meant increase or decrease. Preferably, a compound that 
modulates cholesterol levels (e.g., HDL-cholesterol levels, LDL-cholesterol levels, or 

25 total cholesterol levels), or ABCl biological activity, expression, stability, or 
degradation does so by at least 10%, more preferably by at least 25%, and most 
preferably by at least 50%. 

By *'purified antibody*' is meant antibody which is at least 60%, by weight, 
free from proteins and naturally occiming organic molecules with which it is naturally 

30 associated. Preferably, the preparation is at least 75%, more preferably 90%, and 
most preferably at least 99%, by weight, antibody. A purified antibody may be 
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obtained, for example, by affinity chromatography using recombinantly-produced 
protein or conserved motif peptides and standard techniques. 

By "specifically binds'* is meant an antibody that recognizes and binds to, for 
example, a human ABCl polypeptide but does not substantially recognize and bind to 
5 other non-ABCl molecules in a sample, e.g., a biological sample, that naturally 

includes protein. A preferred saitibody binds to the ABCl polypeptide sequence of 
Fig.9A(SEQIDNO: 1). 

By **polymorphism** is meant that a nucleotide or nncleotide region is 
characterized as occurring in several different forms. A "mutation" is a form of a 
10 polymorphism in which the expression level, stability, function, or biological activity 
of the encoded protein is substantially altered. 

By "ABC transporter** or "ABC polypeptide** is meant any transporter that 
hydrolyzes ATP and transports a substance across a membrane. Preferably, an ABC 
transporter polypeptide includes an ATP Binding Cassette and a transmembrane 
15 region. Examples of ABC transporters include, but are not limited to, ABCl, ABC2, 
ABCR,andABC8, 

By "ABCl polypeptide" is meant a polypeptide having substantial identity to 
an ABCl polypeptide having the amino acid sequence of SEQ ID NO: 1. 

By "ABC biological activity" or "ABCl biological activity** is meant 

20 hydrolysis or binding of ATP, transport of a compoimd (e.g., cholesterol, interleukin- 
1) or ion across a membrane, or regulation of cholesterol or phospholipid levels (e.g., 
either by increasing or decreasing HDL-cholesterol or LDL-cholesterol levels). 

The invention provides screening procedures for identifying ther^e^tic 
compounds (cholesterol-modulating or anti-CVD pharmaceuticals) which can be used 

25 in human patients. Compounds that modulate ABC biological activity (e.g., ABCl 
biological activity) are considered useful in the invention, as are compounds that 
modulate ABC concentration, protein stability, regulated catabolism, or its ability to 
bind other proteins or factors. In general, the screening methods of the invention 
involve screening any number of compounds for therapeutically active agents by 

30 employing any number of in vitro or in vivo experimental systems. Exemplary 
methods useful for the identification of such compounds are detailed below. 
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The methods of the invention simplify the evaluation, identification and 
development of active agents for the treatment and prevention of low HDL and CVD* 
In general, the screening methods provide a £icile means for selecting natural product 
extracts or compounds of interest from a large population which are further evaluated 
5 and condensed to a few active and selective materials. Constitutes of this pool are 
then purified and evaluated in the methods of the invention to determine their HDL- 
raising or anti-CVD activities or both. 

Other features and advantages of the invention will be apparent from the 
following description of the preferred embodim^ts thereof, and from the claims. 

10 

Brijef Description of the Drawings 
Figs. 1 A and IB are schematic illustrations showing two pedigrees with 
Tangier Disease, (TD-1 and TD-2). Square and circle symbols represent males and 
females, respectively. Diagonal lines are placed through the symbols of all deceased 

15 individuals, A shaded symbol on both alleles indicates the probands with Tangier 
Disease, Individuals with half shaded symbols have HDL-C levels at or below the 
10th percentile for age and sex, while those with quarter shaded symbols have HDL-C 
between the 1 1th and 20th percentiles. 

Each individual's ID number, age at the time of lipid measurement, 

20 triglyceride level and HDL cholesterol level followed by their percentile ranking for 
age and sex are listed below the pedigree symbol. Markers spanning the 9q31.1 
region are displayed to the left of the pedigree. The affected allele is represented by 
the darkened bars which illustrate the mapping of the limits of the shared haplotype 
region as seen in Fig. 3. Parentheses connote inferred marker data, questions marks 

25 indicate unknown genotypes, and large arrows show the probands. 

Fig. IC shows ApoAI (10 ^.g/mL) -mediated cellular cholesterol efflux in 
control fibroblasts (n-5, normalized to 100%) and two subjects with Tangier disease 
(TD). Cells were ^H-cholesterol (0.2 fiCi/mL) labeled during growth and cholesterol 
(20 ^g/mL) loaded in growth arrest. Cholesterol efflux is determined as 

30 mediuni/(^H cell + ^H medium) 
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Fig. 2 A - 2D are schematic illustrations showing ibur French Canadian 
pedigrees with FHA (FHA-1 to -4). The notations are as in Fig. 1. Exclamation points 
on either side of a genotype (as noted in Families FHA-3 and FHA-4) are used when 
the marker data appears to be inconsistent due to potential microsatellite repeat 
5 expansions. A bar that becomes a single thin line suggests that the haplotype is 
indeterminate at that marker. 

Figs. 3A - 3E are a schematic illustration showing a genetic and physical map 
of 9q3 1 spanning 35 cM, Fig. 3A: YACs from the region of 9q22-34 were identified 
and a YAC contig spanning this region was constructed. Fig. 3B; A total of 22 

10 polymorphic CA microsatellite markers were mapped to the contig and used in 

haplotype analysis in TD-1 and TD-2. Fig 3C: The mutant haplotypes for probands in 
TD-1 and -2 indicate a significant region of homozygosity in TD-2, while the 
proband in TD-1 has 2 different mutant haplotypes. The candidate region can be 
narrowed to the region of homozygosity for CA markers in proband 2. A critical 

15 crossover at D9S1690 in TD-1 (A)* also provides a centromeric boundary for the 

region containing the gene. Three candidate genes in this region {ABC I, LPA-R and 
RGS-3) are shown. Fig, 3D: Meiotic recombinations in the FHA families (A-H) 
refine the minimal critical region to 1,2 cM between D9S277 and D9S1866. The 
heterozygosity of the TD-2 proband at D9S127, which ends a continuous region of 

20 homozygosity in TD-2, fiuther refines the region to less than 1 cM. This is the region 
to which ABCl has been mapped. Fig. 3E: Isolated YAC DMA and selected markers 
from the region were used to probe highrdcnsity BAC grid filters, selecting BACs 
which via STS-content mapping produced an 800 Kb contig. Four BACs containing 
ABC J were sequenced using high-throughput methods. 

25 Fig. 4 A shows sequence of one mutation in family TD-L Patient III-Ol is 

heterozygoxis for a T to C transition at nucleotide 4503 of the cDNA; the control is 
homozygous for T at this position. This mutation corresponds to a cysteine to 
arginine substitution in the ABCl protein (C1477R). 

Fig 4B shows the amino acid sequence conservation of residue 1477 in mouse 

30 and human, but not a related C elegans gene. A change from cysteine to arginine 

likely has an important effect on the protein secondary and tertiary structure, as noted 
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by its negative scores in most substitution matrices (Schuler et al., A Practical Guide 
to the Analysis of Genes and Proteins, eds. Baxevanis, A.D. & Ouellette, B,F.F. 
1 45 : 1 7 1 , 1 998). The DNA sequences of the normal and mutant genes are shown 
above and below the amino acid sequences, respectively. 
5 Fig. 4C shows the segregation of the T4503C mutation in TD-1. The presence 

of the T4503C mutation (+) was assayed by restriction enzyme digestion with Hgal^ 
which cuts only the mutant (C) allele (i ). Thus, in the absence of the mutation, only 
the 194 bp PGR product (amplified between and -) is observed, while in its 
presence the PGR product is cleaved into j&agments of 1 34 bp and 60 bp. The 

10 proband (individual III.0 1 ) was observed to be heterozygous for this mutation (as 
indicated by both the 194 bp and 134 bp bands), as were his daughter, father, and 
three paternal cousins. A fourth cousin and three of the father's siblings were not 
carriers of this mutation. 

Fig. 4D shows Northern blot analysis with probes spanning the complete 

15 ABCl gene reveal the expected ~8 Kb transcript and, in addition, a -3.5 kb truncated 
transcript only seen in the proband TD-1 and not in TD-2 or control. This was 
detected by probes spanning exons 1-49 (a), 1-41 (b), 1-22 (c), and 23-29 (d), but not 
with probes spanning exons 30-41 (e) or 42-49 (f). 

Fig. 5 A shows the sequence of the mutation in family TD-2. Patient IV- 10 is 

20 homozygous for an A to G transition at nucleotide 1 864 of the cDNA (SEQ ID NO: 
2); the control is homozygous for A at this position. This mutation corresponds to a 
glutamine to arginine substitution in the ABCl protein (Q597R). 

Fig. 5B shows that the glutamine amino acid, which is mutated in the TD-2 
proband, is conserved in human and mouse ABCl as well as in an ABC orthologue 

25 from C elegans, revealing the specific importance of this residue in the 

structure/function of this ABC protein in both worms and mammals. The DNA 
sequences of the normal and mutant proteins are shown above and below the amino 
acid sequences, respectively. 

Fig. 5C shows the segregation of the A1864G mutation in TD-2. The 

30 presence of the A1864G mutation (indicated by +) was assayed by restriction enzyme 
digestion with AciL The 360 bp PCR product has one invariant Acil recognition site 
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(I), and a second one is created by the A1864G mutation. The wild-type allele is thus 
cleaved to firagments of 215 bp and 145 bp, while the mutant allele (G-allele) is 
cleaved to firagments of 1S5 bp, 145 bp and 30 bp. The proband (individual IV-IO), 
the product of a consanguineous mating, was homozygous for the A1864G mutation 
5 (+/+), as evidenced by the presence of only the 185 bp and 145 bp bands, while four 
other family members for whom DNA was tested are heterozygous carriers of this 
mutation (both the 215 bp and 185 bp fragments were present). Two unaffected 
individuals (V-), with only the 215 bp and 145 bp bands are shown for comparison. 
Fig. 6A shows a sequence of the mutation in family FHA-1 , Patient IH-Ol is 
10 heterozygous for a deletion of nucleotides 215 1-2153 of the cDNA (SEQ ID NO: 2). 
This deletion was detected as a superimposed sequence starting at the first nucleotide 
after the deletion. This coiresponds to deletion of leucine 693 in the ABCl protein 
(SEQ ID NO: 1). 

Fig. 6B is an alignment of the human and mouse wild-type amino acid 
15 sequences, showing that the human and mouse sequences are identical in the vicinity 
of AL693. L693 is also conserved in C. elegans. This highly conserved residue lies 
within a predicted transmembrane domain. The DNA sequences of the normal and 
mutant proteins are shown above and below the amino acid sequences, respectively. 
Fig. 6C shows segregation of the AL693 mutation in FHA-1, as assayed by 
20 Earl restriction digestion. Two invariant Earl restriction sites (indicated by I) are 
present within the 297 bp PGR product located between the horizontal arrows (--) 
while a third site is present in the wild-type allele only. The presence of the mutant 
allele is thus distinguished by the presence of a 210 bp fragment (+), while the normal 
allele produces a 151 bp fragment (*). The proband of this family (in.Ol) is 
25 heterozygous for this mutation, as indicated by the presence of bpth the 2 1 0 and 151 
bp bands. 

Fig. 6D shows a sequence of the mutation in family FHA-3. Patient III-O l is 
heterozygous for a deletion of nucleotides 5752-5757 of the cDNA (SEQ ID NO: 2). 
This deletion was detected as a superimposed sequence starting at the first nucleotide 
30 after the deletion. This corresponds to deletion of glutamic acid 1893 and aspartic 
acid 1 894 in the ABCl protein (SEQ ID NO: 1 ). 
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Fig. 6E is aa aligmnent of the human and mouse wild-type amino acid 
sequences, showing that the human and mouse sequences are identical in the vicinity 
of A5752-5757. This region is highly conserved in C elegans. The DNA sequences 
of the normal and mutant proteins are shown above and below the amino acid 
5 sequences, respectively. 

Fig. 6F shows a sequence of the mutation in family FHA-2. Patient ni-Ol is 
heterozygous for a for a C to T transition at nucleotide 6504 of the cDNA (SEQ ID 
NO: 2). This alteration converts an arginine at position 2144 of SEQ ID NO: 1 to a 
STOP codon, causing truncation of the last 118 amino acids of the ABCl protein. 

10 Figs. 7A and 7B show cholesterol efflux from human skin fibroblasts treated 

with ABCl antisense oligonucleotides. Fibroblasts from a control subject were 
labeled with cholesterol (0.2 piCi/mL) during growth for 48 hours and transfected 
with 500 nM ABCl antisense AN-1 (5'-GCA GAG GGC ATG GCT TTA TTT G-3*; 
SEQ ID NO: 3) with 7.5 ^g lipofectin for 4 hours. Following transfection, cells were 

15 cholesterol loaded (20 |ig/mL) for 12 hours and allowed to equilibrate for 6 hours. 

Cells were either then harvested for total RNA and 10 was used for Northern blot 
analysis. Cholesterol efflux experiments were carried out as described herein. Fig. 
7A; AN-1 was the oligonucleotide that resulted in a predictable decrease in ABCl 
RNA transcript levels. Fig. 7B: A double antisense transfection method was used. In 

20 this method, cells were labeled and transfected with AN-1 as above, allowed to 

recover for 20 hours, cholesterol loaded for 24 hours, and tihen re-transfected with 
AN-1 . Twenty hours after the second transfection, the cholesterol efflux as measured. 
A -50% decrease in ABCl transcript levels was associated with a significant decrease 
in cholesterol efflux intermediate between that seen in wild-type and TD fibroblasts. 

25 Fig. 7C shows show cholesterol efflux fi*om human skin fibroblasts treated 

with antisense oligonucleotides directed to the region encoding the amino-terminal 60 
amino acids. Note that the antisense oligonucleotide AN-6, which is directed to the 
previously imrecognized translation start site, produces a substantial decrease in 
cellular cholesterol efflux. 

30 Fig. 8 is a schematic illustration showing predicted topology, mutations, and 

polymorphisms of ABCl in Tangier disease and FHA. The two transmembrane and 
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ATP binding domains are indicated. The locations of mutations are indicated by the 
arrows with the amino acid changes, which are predicted from the human ABC J 
cDNA sequence. These mutations occur in different regions of the ABCl protein. 

Fig, 9A shows the amino acid sequence of the human ABCl protein (SEQ ID 
5 NO: 1). 

Figs. 9B - 9E show the nucleotide sequence of the human ABCl cDNA (SEQ 
ID NO: 2). 

Fig. 10 shows the 5* and 3* nucleotide sequences suitable for use as 5* and 3' 
PGR primers, respectively, for the amplification of the indicated ABCl exon. 

10 Fig. 1 1 shows a summary of alterations found in ABC 1 , including sequencing 

errors, mutations, and polymorphisms. 

Fig. 12 shows a series of genomic contigs (SEQ ID NOS. 14-29) containing 
the ABCl promoter (SEQ ID NO; 14), as well as exons 1-49 (and flanking intronic 
sequence) of ABCl . The exons (capitalized letters) are found in the contigs as 

15 follows: SEQ ID NO: 14-exon 1; SEQ ID NO: 15-exon 2; SEQ ID NO: 16-exon 3; 
SEQ ID NO: IV-exon 4; SEQ ID NO: 18-exon 5; SEQ ID NO: 19--exon 6; SEQ ID 
NO: 20«exons 7 and 8; SEQ ID NO: 21--exons 9 through 22; SEQ ID NO: 22--exons 
23 through 28; SEQ ID NO: 23-exon 29; SEQ ID NO: 24-exons 30 and 3 1 ; SEQ ID 
NO: 25-exon 32; SEQ ID NO: 26-exons 33 through 36; SEQ ID NO: 27-exons 37 

20 through 41 ; SEQ ID NO: 28-exons 42-45; SEQ ID NO: 29-exons 46-49. 

Fig. 13 is a series of illustrations showing that the amino-terminal 60 amino 
acid region of ABCl is protein-coding. Lysates of normal human fibroblasts were 
immunoblotted in parallel with a rabbit polyclonal antibody to amino acids 1-20 of 
human ABC 1 ( 1 ); a rabbit polyclonal antibody to amino acids 1 430- 1 449 of human 

25 ABCl (2); and a mouse monoclonal antibody to amino acids 2236-2259 of human 

ABCV. The additional bands detected in lane 2 may be due to a lack of specificity of 
that antibody or the presence of degradation products of ABCl . 

Fig. 14 is a schematic illustration showing that the WHAM chicken contains a 
non-conservative substitution (G265A) resulting in an amino acid change (E89K). 

30 Fig. 15 is a schematic illustration showing that the mutation in the WHAM 

chicken is at an amino acid that is conserved among human, mouse, and chicken. 
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Fig. 16 show a summary of locations of consensus transcription factor binding 
sites in the human ABCl promoter (nucleotides 1-8238 of SEQ ID NO: 14). The 
abbreviations are as follows: PPRE^eroxisome proliferator-activated receptor. 
SRE-steroid response element-binding protein site. ROR=RAR-related orphan 
5 receptor. 



Detailed Description 
Genes play a significant role influencing HDL levels, Tangier disease (TD) 

10 was the first reported genetic HDL deficiency. The molecular basis for TD is 

unknown, but has been mapped to 9q3 1 in three families. We have identified two 
additional probands and their families, and confirmed linkage and refined the locus to 
a limited genomic region. Mutations in the ABCJ gene accounting for all four alleles 
in these two families were detected* A more frequent cause of low HDL levels is a 

15 distinct disorder, familial HDL deficiency (FHA). On the basis of independent 

linkage, meiotic recombinants and disease associated haplotypes, FHA was localized 
to a small genomic region encompassing the ABC I gene. A mutation in a conserved 
residue in ABCl segregated with FHA. Antisense reduction of the ABCl transcript in 
fibroblasts was associated with a significant decrease in cholesterol efflux, 

20 Cholesterol is normally assembled with intracellular lipids and secreted, but in 

TD the process is diverted and cholesterol is degraded in lysosomes. This disturbance 
in intracellular trafficking of cholesterol results in an increase in intracellular 
cholesterol ester accumulation associated with morphological changes of lysosomes 
and the Golgi apparatus and cholesteryl ester storage in histiocytes, Schwann cells, 

25 smooth muscle cells, mast cells and fibroblasts. 

The clinical and biochemical heterogeneity in patients with TD has led to the 
possibility that genetic heterogeneity may also underlie this disorder. Considering 
this, we initially performed linkage analysis on these two families of different 
ancestries (TD-l is Dutch, TD-2 is British; Frohlich et al., Clin. Invest. Med. 10:377- 

30 382, 1987) and confirmed that the genetic mutations underlying TD in these families 
were localized to the same 9q3 1 region, to which a large family with TD had been 
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assigned (Rust et al.. Nature Genetics 20:96-98, 1998). Detailed haplotype analysis, 
together with the construction of a physical map, refined the localization of this gene. 
Mutations in the ABC 1 gene were found in TD. 

FHA is much more common than TD, although its precise frequency is not 
5 known. While TD has been described to date in only 40 families, we have identified 
more than 40 FHA families in the Netherlands and Quebec alone. After initial 
suggestions of linkage to 9q31, thirteen polymorphic markers spanning approximately 
10 cM in this region were typed and demonstrated the highest LOD score at D9S277. 
Analysis of the homozygosity of markers in the TD-2 proband, who was expected to 

10 be homozygous for markers close to TD due to his parents' consanguinity, placed the 
TD gene distal to D95 127. Combined genetic data from TD and FHA families 
pointed to the same genomic segment spanning approximately 1,000 kb between 
D9Sl27andD9S1866, ThQABCl transporter gene was contained within the minimal 
genomic region. RT-PCR analysis in one family demonstrated a deletion of leucine at 

15 residue 693 (A693) in the first transmembrane domain of ABCl, which segregated 
with the phenotype of HDL deficiency in this family. 

ABCl is part of the ATP-binding cassette (ABC transporter) superfamily, 
which is involved in energy-dependent transport of a wide variety of substrates across 
membranes (Dean et aL, Curr, Opin. Gen. Dev. 5:779-785, 1995). These proteins 

20 have characteristic motifs conserved throughout evolution which distinguish this class 
of proteins from other ATP binding proteins. In humans these genes essentially 
encode two ATP binding segments and two transmembrane domains (Dean et al., 
Curr. Opin. Gen. Dev. 5:779-785, 1995). We have now shown that the ABCl 
transporter is crucial for intracellular cholesterol transport. 

25 We have demonstrated that reduction of the ABCJ transcript using 

oligonucleotide antisense approaches results in decreased efflux, clearly 
demonstrating the link between alterations in this gene and its functional effects. TD 
and FHA now join the growing list of genetic diseases due to defects in the ABC 
group of proteins including cystic fibrosis (Zielenski, et al., Annu. Rev, Genet. 

30 29:777-807, 1995), adrenoleukodystrophy (Mosser et al.. Nature 361 : 726-730, 1993), 
Zellweger syndrome (Gartner et al., Nat. Genet. 1 :23, 1 992), progressive familial 
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intrahepatic cholestatis (Bull et al., Nat. Genet. 18:219-224, 1998), and different eye 
disorders including Stargardt disease (Allikmets et al, Nat, Genet* 15:236-246, 1997), 
autosomal recessive retinitis pigmentosa (Allikmets et aL, Science 277:1805-1807, 
1997), and cone-rod dystrophy (Cremers et al.. Hum. Mol. Genet. 7:355-362, 1998). 
5 Patients with TD have been distinguished from patients with FHA on the basis 

that Tangier disease was an autosomal recessive disorder (OMIM 20540) while FHA 
is inherited as an autosomal dominant trait (OMIM 10768). Furthermore, patients 
with TD have obvious evidence for intracellular cholesterol accumulation which is not 
seen in FHA patients. It is now evident that heterozygotes for TD do have reduced 
10 HDL levels and that the same mechanisms xmderlie the HDL deficiency and 

cholesterol efflux defects seen in heterozygotes for TD as well as FHA. Furthermore, 
the more severe phenotype in TD represents loss of function from both alleles of the 
ABCJ gene. 

ABCl is activated by protein kinases, presumably via phosphorylation, which 

15 also provides one explanation for the essential role of activation of protein kinase C in 
promoting cholesterol efflux (Drobnick et al., Arterioscler. Thromb. Vase. Biol. 15: 
1369-1377, 1995). Brefeldin, which inhibits trafficking between the endoplasmic 
reticulum and the Golgi, significantly inhibits cholesterol efflux, essentially 
reproducing the effect of mutations in ABCl, presumably through the inhibition of 

20 ABCl biological activity* This finding has significance for the vmderstanding of 
mechanisms leading to premature atherosclerosis. TD homozygotes develop 
premature coronary artery disease, as seen in the proband of TD-1 (III-Ol ) who had 
evidence for coronary artery disease at 38 years. This is particular noteworthy as TD 
patients, in addition to exhibiting significantly reduced HDL, also have low LDL 

25 cholesterol, and yet they develop atherosclerosis despite this. This highlights the 
importance of HDL intracellular transport as an important mechanism in 
atherogenesis. There is significant evidence that heterozygotes for TD are also at 
increased risk for premature vascular disease (Schaefer et al., Ann. Int Med. 93:261- 
266, 1980; Serfaty-Lacrosniere et al., Atherosclerosis 107:85-98, 1994), There is also 

30 preliminary evidence for premature atherosclerosis in some probands with FHA (Fig. 

2B), e.g., the proband in FHA-2 (III-Ol) had a coronary artery bypass graft at 46 years 
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while the proband in FHA-3 (Fig. 2C) had evidence for GAD around 50 years of age. 
The TD-1 proband had more severe efflux deficiency than the TD-2 proband (Fig. 
1 C). Interestingly> the TD-2 proband had no evidence for CAD by 62 when he died 
of unrelated causes, providing preliminary evidence for a relationship between the 
5 degree of cholesterol efflux (mediated in part by the nature of the mutation) and the 
likelihood of atherosclerosis. 

The ABCl gene plays a crucial role in cholesterol transport and, in particular, 
intracellular cholesterol trafficking in monocytes and fibroblasts. It also appears to 
play a significant role in other tissues such as the nervous system, GI tract, and the 
10 cornea. Completely defective intracellular cholesterol transport results in peripheral 
neuropathy, corneal opacities, and deposition of cholesterol esters in the rectal 
mucosa. 

HDL deficiency is heterogeneous in nature. The delineation of the genetic 
basis of TD and FHA underlies the importance of this particular pathway in 

15 intracellular cholesterol transport, and its role in the pathogenesis of atherosclerosis. 
Unravehng of the molecular basis for TD and FHA defines a key step in a poorly 
defined pathway of cholesterol efflux from cells and could lead to new approaches to 
treatment of patients with HDL deficiency in the general population. 

HDL has been implicated in numerous other biological processes, including 

20 but not limited to: prevention of lipoprotein oxidation; absorption of endotoxins; 

protection against Trypanosoma brucei infection; modulation of endothelial cells; and 
prevention of platelet aggregation (seeGenest et al., J. Invest, Med. 47: 31-42, 1999, 
hereby incorporated by reference). Any compound that modulates HDL levels may 
be useful in modulating one or more of the foregoing processes. The present 

25 discovery that ABCl functions to regulate HDL levels links, for the first time, ABCl 
with the foregoing processes. 

The following examples are to illustrate the invention. They are not meant to 
limit the invention in any way* 
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Anal ysis of TP FamiHfts 

Studies of cholesterol efflux 

Both probands had e\ddence of marked deficiency of cholesterol efflux similar 
to that previously demonstrated in TD patients (Fig. IC). TD-1 is of Dutch descent 
5 while TD-2 is of British descent. 



Linkage analysis and establishment of a physical map 

Multiple DNA markers were genotyped in the region of 9q3 1 to which linkage 
to TD had been described (Rust et al., Nat. Genet. 20, 96-98, 1998). Two point 

10 linkage analysis gave a maximal peak LOD score of 6.49 at D9S 1 832 

(Table 1) with significant evidence of linkage to all markers in a --10 cM interval. 
Recombination with the most proximal marker* D9S1690 was seen in 11-09 in Family 
TD-1 (A* in Fig. 3D) providing a centromeric boundary for the disease gene. 
Multipoint linkage analysis of these data did not increase the precision of the 

15 positioning of the disease trait locus, 

A physical map spanning approximately 10 cM in this region was established 
with the development of a YAC contig (Fig. 3 A). In addition, 22 other polymorphic 
multi-allelic markers which spanned this particular region were mapped to the contig 
(Fig. 3B) and a subset of these were used in construction of a haplotype for further 

20 analysis (Figs. lA and IB; Table 2). 

While the family of Dutch decent did not demonstrate any consanguinity, the 
proband in TD-2 was the offspring of a first-cousin consanguineous marriage (Fig. 
IB). We postulated, therefore, that it was most likely that this proband would be 
homozygous for the mutation while the proband in the Dutch family was likely to be a 

25 compound heterozygote. The Dutch proband shows completely different 
mutation bearing haplotypes, supporting this hypothesis (Fig. 3C). 

The TD-2 proband was homozygous for all markers tested (Fig. IB) distal to 
D9S127 but was heterozygous at D9S127 and DNA markers centromeric to it (Fig. 
3C). This suggested that the gene for TD was likely located to the genomic region 

30 telomeric of D9S 127 and encompassed by the markers demonstrating homozygosity 
(Fig.3B). 
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Two Po!nt Linkage Analysis of TD-1 andTO-2 
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Results of paiwsc linkage analysis using MLINK. Values correspond to the LOD score for linkage 
between the disease locus and a marker locus for specified values of the rccombinaiaon fraction. 
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TABLE 2. MlcrosateMit* markers u^d in this sludy 
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Alleie frequency' 
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0.87 
0,62 
0.86 
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0^4 
OJl 
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0.4? 
0.77 
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0.62 

o.a} 

0.78 
0.62 



10 
9 
3 
15 
6 
13 
11 
15 

lua. 
. nJL 

tua. 
5 
3 
3 
12 
A 
10 



10 
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Umosf 133(ai:M.-7cO.;3): I39<0.0J): I41t0i»): 1^.^0.05); 
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l-t9{0.l \% IS UQXny. WXO^v l J5<0.05TC 157(0.45): I39(Oj06) - 
112(0.061: »a*(OLO«); 12«(0JBi: ::4(0.0:i: 126(0.00 ^' 

1 7*4atD)i 176(0.02); I7S«l.eO>: 130(0.05* H2tO,M): I84<0J2>J ISflfO 151' t«SlO(VS»- 
JM(O.0ai: 192(0.07): IWtOXli: i9fi(OJ07): 198(0,01): aoJoSn iSSxi)* 
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29K0XM): 297ia0a); 2?9(0j;): 303(a«::: 303(OJJ2) 
K0.*21: 2(0 J6); 3(0.03) 

[«l(0.04): lfi3(0.02): IWWiaj: 169(0,oa): 171(0.10); 17310.09): n3{C,l3): I77(0J8): 
119(0.59): iai(O.O4)ilB3(0j0I):t83(0.Ol) ««,*;>M/™5J. 
1 10(aO2): 1 12(0 J3): 116(0^*); 1 18(0.07) ' 

iwSoah I81t0.021:l»4<0.1«: U«|0.40); l«B(0.1i): 150(0.0*): 

9(H0.02):92(QJ2);9*(0.02):H(0.02): 100(0^9); 102(0.04)! i04t0.08) 
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34*<0.09);246(Qai); 248(0^, 230tOJ9y. 232(0.03); 234<0JB9) 

190(0 J I); i96(a07); t9l«J2): 200(0.091 



n,a^« BO i'ms^jI^^^^"**'*"" Canadian or Ffcnch descent (J. Weissenbach. Psnonnai Communicadon 1993), 

pcse polymorphic microsatcUite markers were used for DNa typing in the region of 9q31 seen 
m -nic majority come from the last version of the Gcn^thon humim linkage map ». The 
trcqucucy of heterozygosity, the tiumber of alleles as well as the allele frequency of each marker 
are presented. j *t 
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Mutation detection 

Based on the defect in intracellular cholesterol transport in patients with TD, 
we reviewed the EST database for genes in this region which might be relevant to 
playing a role in this process. One gene that we reviewed as a candidate was the 
5 lysophosphatidic acid (LP A) receptor (EDG2) which m^ped near D9S 1 80 1 (Fig. 
3C). This receptor binds LPA and stimulates phospholipase-C (PLC), and is 
expressed in fibroblasts. It has previously been shown that the coordinate regulation 
of PLC that is necessary for normal HDL3 mediated cholesterol efflux is impaired in 
TD (Walter et aL, J. Clin. Invest. 98:2315-2323, 1996). Therefore this gene 

10 represented an excellent candidate for the TD gene. Detailed assessment of this gene, 
using Northern blot and RT-PCR and sequencing analysis, revealed no changes 
segregating with the mutant phenotype in this family, in all likelihood excluding this 
gene as the cause for TD. Polymorphisms were detected, however, in the RT-PCR 
product, mdicating expression of transcripts from both alleles. 

15 The second candidate gene (RGS3) encodes a member of a family regulating G 

protein signaling which could also be involved in influencing cholesterol efflux 
(Mendez et al.. Trans. Assoc, Amer. Phys. 104:48-53, 1991). This gene mapped 0.7 
cM telomeric to the LPA-receptor (Fig. 3C), and is expressed in fibroblasts. It was 
assessed by exon-specific amplification, as its genomic organization was published 

20 (Chatterjee et al.. Genomics 45:429-433, 1997). No significant sequence changes 
were detected. 

The ABCI transporter gene had previously been mapped to 9q3 1 , but its 
precise physical location had not been determined (Luciani et al. Genomics 21:150- 
1 59, 1 994), The ABCI gene is a member of the ATP binding cassette transporters 

25 which represents a super family of highly conserved proteins involved in membrane 
transport of diverse substrates including amino acids, peptides, vitamins and steroid 
hormones (Luciani et aL, Genomics 21 :150-159, 1994; Dean et al., Curr. Opin. Gen, 
Dev, 5:779-785, 1995). Primers to the 3' XJTR of this gene mapped to YACs 
spanning D9S306 (887-B2 and 930-D3) compatible with it being a strong candidate 

30 forTD, We initiated large scale genomic sequencing of BAGs spanning 

approximately 800 kb around marker D9S306 (BACs 269, 274, 279 and 291) (Fig 
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3E), The ABCI gene was revealed encompassing 49 exons and a inininium of 75 Kb 
of genomic sequence. In view of the potential fimction of a gene in this family as a 
cholesterol transporter^ its expression in fibroblasts and localization to the minimal 
genomic segment underlying TD, we formally assessed ABCI as a candidate. 
5 Patient and control total fibroblast RNA was used in Northern blot analysis 

and RT-PCR and sequence analyses. RT-PCR and sequence analysis of TD-1 
revealed a heterozygous T to C substitution (Fig. 4A) in the TD-1 proband, which 
would result in a substitution of arginine for cysteine at a conserved residue between 
mouse and man (Fig, 4B). This mutation, confirmed by sequencing exon 30 of the 

10 ABCI gene, exhibited complete segregation with the phenotype on one side of this 

family (Fig, 4C)* This substitution creates a Hgal site, allowing for RFLP analysis of 
amplified genomic DNA and confirmation of the mutation (Fig. 4C), The point 
mutation in exon 30 was not seen on over 200 normal chromosomes from unaffected 
persons of Dutch decent, and 250 chromosomes of Westem European decent, 

15 indicating it is unlikely to be a polymorphism. Northern blot analysis of fibroblast 
RNA from this patient, using a cDNA encompassing exons 1 to 49 of the gene, 
revealed a normal sized ~8 Kb transcript and a truncated mutant transcript which was 
not visible in control RNA or in RNA firom other patients with HDL deficiency (Fig. 
4D). Additionally, Northern blot analysis using clones encompassing discrete regions 

20 of the cONA revealed that the mutant transcript was detected with a cDNA 

compassing exons 1 to 49 (a), 1 to 41 (b), 1 to 22 (c), much more faintly with a probe 
spanning exon 23 to 29 (d) and not seen with probes encompassing exons 30 to 42 (e), 
but not seen with cDNA fragment spanning exons 30 to 49 (0- This was repeated on 
multiple filters with control RNA, RNA from other patients with HDL deficiency and 

25 the other TD proband, and only in TD-1 was the truncated transcript observed. 

Sequence ansdysis of the coding region did not reveal an alteration in sequence that 
could account for this finding. Furthermore, DNA analysis by Southern blot did not 
reveal any major rearrangements. Completion of exon sequencing in genomic DNA 
showed that this mutation was a G to C transversion at position (4-1) of intron 24, 

30 (Fig. 11) affecting a splice donor site and causing aberrant splicing. 

RT-PCR analysis of fibroblast RNA encoding theABCJ gene from the 
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proband in TD-2 (Fig. IB) revealed a homozygous nucleotide change of A to G at 
nucleotide 1864 of SEQ ID NO: 2 in exon 13 (Fig. 5 A), resulting in a substitution of 
arginine for glutamine at residue 597 of SEQ ID NO: 1 (Fig. 5B), occurring just 
proximal to the first predicted transmembrane domain of ABCl (Fig. 8) at a residue 
5 conserved in mouse and as well as a C elegans homolog. This mutation creates a 
second Acil site within exon 13. Segregation analysis of the mutation in this fandly 
revealed complete concordance between the mutation and the low HDL phenotype as 
predicted (Fig 5C). The proband in TD-2 is homozygous for this mutation, consistent 
with our expectation of a disease causing mutation in this consanguineous family. 

10 

Analysis of FHA families 

Linkage analysis and refinement of ike minimal genomic region containing the 
gene for FHA 

Data from microsatellite typing of individual family members from the four 

15 pedigrees of French Canadian origin were analyzed (Fig. 2), A maximum LOD score 
of 9.67 at a recombination fraction of 0.0 was detected at D9S277 on chromosome 
9q31 (Fig. 3; Table 3). Thereafter, 22 markers were typed in a region spanning 10 cM 
around this locus in these families (Figs. 2 and 3). The frequency for these markers 
were estimated from a sample of unrelated and unaffected subjects of French ancestry 

20 (Table 2). 

TD and FHA have thus far been deemed distinct with separate clinical and 
biochemical characteristics. Even though the genes for these disorders m^ped to the 
same region, it was uncertain whether FHA and TD were due to mutations in the same 
gene or, alternatively, due to mutations in genes in a similar region. Refinement of 

25 the region containing the gene for FHA was possible by examining haplotype sharing 
and identification of critical recombination events (Fig. 2). Seven separate meiotic 
recombination events were seen in these families ("A" through "G" in Figs. 2 and 3), 
clearly indicating that the minimal genomic region containing the potential disease 
gene was a region of approximately 4.4 cM genomic DNA spanned by marker 

30 D9S1690 and D9S1 866 (Figs. 2 and 3). This region is consistent with the results of 
two point linkage analysis which revealed maximal LOD scores with markers 
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TABLE 3 
Two Point Linkage Analysts in FHA 

10 
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Results of pairwise linkage analysis using MLINK. Values cofTcspcnd lo the LOD score. for linkage 
between the disease locus and a marks: locus for specified values of the recombination fraction. 
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D9S277 and D9S306 and essentially excluded the region: centromeric to D9S1690 or 
telomeric to D9S1866. An 8* meiotic recombination event ("H" in Fig, 3) further 
refined the FHA region to distal to D9S277. 

As described herein, the ABC! gene mapped within this interval. The 

5 overlapping genetic data strongly suggested that FHA may in fact be allelic to TD. 

Utilization of sets of genetic data from FHA and TD provided a telomeric boundary at 
D9S1866 (meiotic recombinant) (Fig. 3D) and a centromeric marker at D9S127 based 
on the homozygosity data of TD-2. This refined the locus to approximately 1 Mb 
between D9S127 and D9S1S66. The ABC J gene mapped within this minimal region 

10 (Fig. 3E). 



30 



Mutation detection in FHA 

Mutation assessment of the ABC J gene was undertaken in FHA-1 (Fig. 2 A). 
25 Using primers that spanned overlapping segments of the mRNA we performed RT- 

15 PGR analysis and subjected these fragments to mutational analysis. A deletion of 
three nucleotides is evident in the RT-PCR sequence of FHA-l m.Ol (Fig. 6A), 
resulting in a loss of nucleotides 2151-2153 of SEQ ID NO: 2 and deletion of a 
leucine (AL693) at amino acid position 693 of SEQ ID NO: I (Fig. 6A). This leucine 
is conserved in mouse and C. elegans (Fig. 6B), The alteration was detected in the 
20 RT-PCR products as well as in genomic sequence from exon 14 specific 
35 amplification. This mutation results in a loss of an Earl restriction site. Analysis of 

genomic DNA from the family indicated that the mutation segregated completely with 
the phenotype of HDL deficiency. The loss of the Earl site results in a larger 
fragment being remaining in persons heterozygous for this mutation (Fig. 6C). This 

40 

25 mutation maps to the first putative transmembrane domain of ABCl (Fig. 8) and was 
not seen in 130 chromosomes from persons of French Canadian descent nor seen in 
over 400 chromosomes from persons of other Western European ancestry. 
45 A mutation has also been found in patient genomic DNA in pedigree FHA-3 

from Quebec. The alteration, a 6 bp deletion of nucleotides 5752-5757 of SEQ ID 

30 NO: 2 within exon 41, results in a deletion of amino acids 1893 (Glu) and 1894 (Asp) 
of SEQ ID NO: 1- The deletion was detected as a double, superimposed, sequence 
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starting from the point of the deletion (Fig. 6D), and was detected in sequence reads in 
both directions. The deletion can be detected on 3% agarose or 10% polyacrylamide 
gels, and segregates with disease in FHA-3. It was not seen in 128 normal 
chromosomes of French-Canadian origin or in 434 other control chromosomes. 
5 Amino acids 1 893 and 1 894 are in a region of the ABC 1 protein that is conserved 
between hmnan, mouse, and C. elegans (Fig. 6E), implying that it is of functional 
importance. 

An additional mutation has been found in patient genomic DNA in pedigree 
FHA-2 from Quebec (Fig. 6F). The alteration, a C to T transition at position 6504 of 
10 SEQ ID NO: 2, converts an arginme at position 2144 of SEQ ID NO: 1 to a STOP 
codon, causing truncation of the last 118 amino acids of the ABCl protein. This 
alteration segregates with disease in family FHA-2. 

A summary of all mutations and polymorphisms found in ABCl is shown in 
Fig. 1 1 . Each variant indicated as a mutation segregates with low HDL in its family, 
15 and was not seen in several hundred control chromosomes. 

Functional relationship between changes in ABCl transcript levels and 
cholesterol efflux 

Antisense approaches were xmdertaken to decrease the ABCl transcript and 
20 assess the effect of alteration of the transcript on intracellular cholesterol transport. 

The use of antisense primers to the 5 * end of ABCl clearly resulted in a decrease to 
approximately 50% of normal RNA levels (Fig. 7A). This would be expected to 
mimic in part the loss of function due to mutations on one allele, similar to that seen 
in heterozygotes for TD and patients with FHA. Importantly, reduction in the mRNA 
25 for the ABCl gene resulted in a significant reduction in cellular cholesterol efflux 
(Fig. 7B), further establishing the role of this protein in reverse cholesterol transport 
and providing evidence that the mutations detected are likely to constitute loss of 
function mutations. Furthermore, these data support the functional importance of the 
first 60 amino acids of the protein. Antisense oligonucleotide AN-6 is directed to the 
30 novel start codon 5' to the one indicated in AJ012376.1; this antisense oligonucleotide 
effectively suppresses efflux. 
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The above-described results were obtained using the following materials and 
methods. 

Patient selection 

5 The probands in TD families had previously been diagnosed as suffering from 

TD based on clinical and biochemical data. Study subjects with FHA were selected 
from the Cardiology Clinic of the Clinical Research Institute of Montreal, The main 
criterion was an HDL-C level <5th percentile for age and gender, with a plasma 
concentration of triglycerides <95th percentile in the proband and a first-degree 
10 relative with the same lipid abnormality. In addition, the patients did not have 
diabetes. 

Biochemical studies 

Blood was withdrawn in EDTA-containing tubes for plasma lipid, lipoprotein 
15 cholesterol, ApoAI, and triglyceride analyses, as well as storage at 

-80**C. Leukocytes \vere isolated from the buffy coat for DNA extraction. 

Lipoprotein measurement was perfomied on firesh plasma as described 

elsewhere (Rogler et al., Arterioscler. Thromb. Vase, Biol. 15:683-690, 1995). The 

laboratory participates and meets the criteria of the Lipid Research Program 
20 Standardization Program. Lipids, cholesterol and triglyceride levels were determined 

in total plasma and plasma at density d<l .006 g/mL (obtained after preparative 

ultracentrifugation) before and after precipitation with dextran manganese. 

Apolipoprotein measurement was performed by nephelometry for ApoB and ApoAI. 

25 Linkage analysis 

Linkage between the trait locus and microsatellite loci was analyzed using the 
FASTLINK version (4.0 P). FASTLINK/MLINK was used for two-point linkage 
analysis assuming an autosomal dominant trait widi complete penetrance. In FHA 
and TD heterozygotes, the phenotype was HDL deficiency <5th percentile for age and 

30 sex. The disease allele frequency was estimated to be 0.005. Marker allele 

frequencies were estimated from the genotypes of the founders in the pedigrees using 



-32- 



wo 00/55318 



PCT/IB0O/00S32 



NEWPREP. Multipoint linkage analysis was carried out using 
FASTLINK/LINKMAP. 

Genomic clone assembly and physical map construction of the 9q31 region 
5 Using the Whitehead Institute/MIT Center for Genome Research map as a 

reference, the genetic markers of interest at 9q31 were identified within YAC contigs. 
Additional markers that mapped to the approximate 9q3 1 interval from public 
databases and the literature were then assayed against the YAC clones by PCR and 
hybridization analysis. The order of markers was based on their presence or absence 

10 in the anchored YAC contigs and later in the BAC contig. Based on the haplotype 
analysis, the region between D9S277 and D9S306 was targeted for higher resolution 
physical mapping studies using bacterial artificial chromosomes (BACs). BACs 
within the region of interest were isolated by hybridization of DNA marker probes 
and whole YACs to high-density filters containing clones from the RPCl-1 1 human 

15 BAC library (Fig, 3). 

Sequence retrieval and alignment 

The human ABCl mRNA sequence was retrieved from GenBank using the 
Entrez nucleotide query (Baxevanis et al., A Practical Guide to the Analysis of Genes 

20 and Proteins, eds. Baxevanis, A.D. & Ouellette, B.F.F, 98:120, 1998) as GenBank 

accession number AJO 12376. 1 , The version of the protein sequence we used as wild- 
type (normal) was CAA 1 0005 . 1 . 

We identified an additional 60 amino acids in-frame with the previously- 
believed start methionine (Fig, 9A). Bioinformatic analysis of the additional amino 

25 acids indicates the presence of a short stretch of basic amino acid residues, followed 
by a hydrophobic stretch, then several polar residues. This may represent a leader 
sequence, or another transmembrane or membrane-associated region of the ABCl 
protein. In order to differentiate among the foregoing possibilities, antibodies 
directed to the region of amino acids 1-60 are raised against and used to determine the 

30 physical relationship of amino acids 1-60 in relation to the cell membrane. Other 

standard methods can also be employed, including, for example, expression of fusion 
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proteins and cell fractionation. 

We also identified six errors in the previously-reported nucleotide sequence (at 
positions 839, 4738, 5017, 5995, 6557, and 6899 of SEQ ID NO: 2; Fig. 1 1). Hence, 
the sequence of the ABCl polypeptide of Fig. 9A differs from CAA10005.1 as 
5 follows: Thr«C>Ile at position 1554; Pro^Leu at position 1642; Arg=^Lys at position 
1973; and Pro^Leu at position 2167. We also identified 5' and 3' UTR seqnence 
(Figs. 9B - 9E). 

The mouse ABCl sequence used has accession number X75926. It is very 
likely that this mouse sequence is incomplete, as it lacks the additional 60 amino acids 

10 . described herein for human ABCl . 

Version 1.7 of ClustalW was used for multiple sequence alignments with 
BOXSHADE for graphical enhancement (http://www.isrec.isb-sib.ch: 8080/ 
software/BOX_form.html) with the default parameter. A Caenorhabditis elegans 
ABCl orthologue was identified with BLAST (version 2.08) using CAA1005.1 (see 

15 above) as a query, with the default parameter except for doing an organism filter for 
C. elegans. The selected protein sequence has accession version number 
AAC69223. 1 with a score of 375, and an E value of 1 03. 

Genomic DNA sequencing 

20 BAG DNA was extracted from bacterial cultures using NucleoBond Plasmid 

Maxi Kits (Clontech, Palo Alto, CA). For DNA sequencing, a sublibrary was first 
constructed from each of the BAG DNAs (Rowen et al., Automated DNA Sequencing 
and Analysis, eds. Adams, M.D., Fields, C. & Venter, J.C., 1994), In brief, the BAG 
DNA was isolated and randomly sheared by nebulization. The sheared DNA was 

25 then size fractionated by agarose gel electrophoresis and ft^gments above 2 kb were 
collected, treated with Mung Bean nuclease followed by T4 DNA polymerase and 
klenow enzyme to ensure blunt^ends, and cloned into Smal-cut M13mpl9. Random 
clones were sequenced with an ABI373 or 377 sequencer and fluorescently labeled 
primers (Applied BioSystems, Foster City, CA). DNAStar software was used for gel 

30 trace analysis and contig assembly. All DNA sequences were examined against 

available public databases primarily using BLASTn with RepeatMasker (University 
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of Washington)* 

Reverse transcription (RI)-PCR amplification and sequence analysis 

Total RNA was isolated from the cultured fibroblasts of TD and FHA patients, 
5 and reverse transcribed with a CDS primer containing oligo d(T)l 8 using 250 units of 
Superscript n reverse transcriptase (Life Technologies, Inc., Rockville, MD) as 
described (Zhang et aL, J. Biol. Chem. 27:1776-1783, 1996). cDNA was amplified 
with Taq DNA polymerase using primers derived from the published human ABCI 
cDNA sequence (Luciani et al.. Genomics 21 : 150-1 59, 1994), Six sets of primer pairs 

10 were designed to amplify each cDNA sample, generating six DNA fragments which 
are sequentially overlapped covering 135 to 7014 bp of the full-length human y4JSC7 
cDNA. The nucleotides are numbered according to the order of the published human 
cDNA sequence (AJ012376J). Primer pairs (1): 135-158 (f) and 1 183-1199 (r); (2): 
1080-1 107 (f) and 2247-2273 (r); (3): 2171-2197 (f) and 3376-3404 (r); (4): 3323- 

15 3353 (f) and 4587-4617 (r); (5) 4515-4539 (f) and 5782-581 1 (r); (6): 5742-5769 (f) 
and 6985-7014 (r). RT-PCR products were purified by Qiagen spin columns. 
Sequencing was carried out in a Model 3 73 A Automated DNA sequencer (Applied 
Biosystems) using Taq di-deoxy terminator cycle sequencing and Big Dye Kits 
according to the manufacturer's protocoL 

20 

Northern blot analysis 

Northern transfer and hybridizations were performed essentially as described 
(Zhang et al., J. BioL Chem. 27:1776-1783, 1996). Briefly, 20 jxg of total fibroblast 
RNA samples were resolved by electrophoresis in a denaturing agarose (1.2%; w/v) 
25 gel in the presence of 7% formaldehyde, and transferred to nylon membranes. The 
filters were probed with ^^P-labeled human ABCI cDNA as indicated. Pre- 
hybridization and hybridizations were carried out in an ExpressHyb solution 
(ClonTech) at 68'*C according to the manufacturer's protocol. 

30 Detection of the mutations in TD 

Genotyping for the T4503C and A1864G variants was performed by PGR 
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amplification of exon 30 followed by restriction digestion with Hgal and 
amplification of exon 13 followed by digestion with-^cil, respectively. PGR was 
carried out in a total volume of 50 |iiL with 1.5 mM MgCl2, 187.5 nM of each dNTP, 
2.5U Taq polymerase and 1 5 pmol of each primer (forward primer in exon 30: 5*- 
5 CTG CCA GGC AGG GGA GGA AGA GTG-3* (SEQ ID NO: 4); reverse primer 

spanning the junction of exon 30 and intron 30: 5'-GAA AGT GAG TCA CTT GTG 
GAG GA-3' (SEQ ID NO: 5); forward primer in intron 12: 5'-AAA GGG GCT TGG 
TAA GGG TA-3* (SEQ ID NO: 6); reverse in intron 13; 5'-CAT GCA CAT GCA 
CAC ACA TA -3' (SEQ ID NO: 7)). Following an initial denaturation of 3 minutes 

10 at 95**C, 35 cycles consisting of 95'='C 10 seconds, SS'^C 30 seconds, 72°C 30 seconds 
were performed, with a final extension of 10 minutes at 72**C. Por detection of the 
T4503C mutation, 15 jiL of exon 30 PCR product was incubated with 4 U Hgdi in a 
total volume of 25 ^L, for 2 hours at 37*'C, and the resulting fragments were separated 
on a 1 .5% agarose gel. The presence of the T4503C mutation creates a restriction site 

15 for Hgaly and thus the 1 94 bp PGR product will be cut into fragments of 1 34 and 60 
bp in the presence of the T4503C variant, but not in its absence. For detection of the 
A1864G mutation, \5 \iL of exon 13 PCR products were digested with 8 Acil for 
three hours at 37*^C. Products were separated on 2% agarose gels. The presence of 
the Ai 864G mutation creates a second Acil site within the PCR product. Thus, the 

20 360 bp PCR product is cleaved into fragments of 215 bp and 1 45 bp on the wild-type 
allele, but 185 bp, 145 bp and 30 bp on the mutant allele. 

Detection of mutation in FHA 

Genotyping for the A693 variant was performed by PCR amplification of exon 

25 14 followed by restriction enzyme digestion with Earl, PCR was carried out in a total 
volume of 80 with 1 .5 mM MgClz, 1 87.5 nM of each dNTP, 2,5 U Taq 
polymerase and 20 pmol of each primer (forward primer in exon 14: 5'- CTT TCT 
GCG GGT GAT GAG CCG GTG AAT-3' (SEQ ID NO: 8); reverse primer in intron 
14:5'.CGTTAGCCCGTGTTGAGGTA-3'(SEQIDNO:9)). Followingan 

30 initial denaturation of 3 minutes at 95*^C, 35 cycles consisting of 95°C 1 0 seconds, 
55°G 30 seconds, 72^C 30 seconds were perfomied, with a final extension of 10 
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minutes at 72^C, Twenty microliters of PGR product was incubated with 4 U Earl in 
a total volume of 25 jiL, for two hours at 37''C, and the ftagments were separated on a 
2 % agarose gel. The presence of the A693 tnutation destroys a restriction site for 
Earl, and thus the 297 bp PGR product will be cut into jfragments of 1 5 1 bp, 59 bp, 48 
bp and 39 bp in the presence of a wild-type allele, but only fragments of 210 bp, 48 bp 
and 39 bp in the presence of the deletion. 

A 6 bp deletion encompassing nucleotides 5752-5757 (inclusive), was 
detected in exon 41 in the proband of family FHA-3 by g^iomic sequencing using 
primers located within the introns flanking this exon, Genotyping of this mutation in 
family FHA-3 and controls was carried out by PGR with forward (5'-CGT GTA AAT 
GGA AAG GTA TCT CCT GT- 3' (SEQ ID NO: 10)) and reverse primers {5*-CGT 
CAA CTC err gat TTC TAA gat GT (SEQ ID NO: 1 1)) located near the 5' and 
3' ends of exon 41 , respectively. Each PGR was carried out as for the genotyping of 
the 693 variant, but with annealing temperature of SS^'C. Twenty microliters of PGR 
product was resolved on 3% agarose or 10% acrylamide gels. The wild type allele 
was detected as a 1 1 7 bp band and the mutant allele as a 1 1 1 bp band upon staining 
with ethidium bromide, 

A C to T transition was detected at nucleotide 6504 in genomic DNA of the 
proband of family FHA-2. It was detectable as a double C and T peak in the genomic 
sequence of exon 48 of this individual, who is heterozygous for the alteration. This 
mutation, which creates a STOP codon that results in truncation of the last 118 amino 
acids of the ABCl protein, also destroys an /?j<3l restriction site that is present in the 
wild type sequence. Genotyping of this mutation in family FHA-2 and controls was 
carried out by PGR with forward (5'-GGG TTC GGA GGG TTC AGT ATO') (SEQ 
ID NO: 12)) and reverse (5*-GAT GAG GAA TTC AAG GAG CAA-30 (SEQ ID 
NO; 13)) primers directed to the intronic sequences flanking exon 48. PGR was done 
as for the 693 variant. Fifteen microliters of PGR product was digested with 5 Units 
ofRsal at 37**C for two hours and the digestion products resolved on 1.5% agarose 
gels. The mutant allele is detected as an uncut 436 bp band. The normal sequence is 
cut by Rsal to produce 332 and 104 bp bands. 
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Cell culture 

Skin fibroblast cultures were established from 3 .0 mm punch biopsies of the 
forearm of FHD patients and healthy control subjects as described (Marcil et al,, 
Arterioscler. Thromb. Vase. Biol. 19:159-169, 1999). 

5 

Cellular cholesterol labeling and loading 

The protocol for cellular cholesterol efflux experiments was described in detail 
elsewhere (Maicil et al., Arterioscler. Thiomb. Vase. Biol. 19:159-169, 1999). The 
cells were ^H-cholesterol labeled dxiring growth and free cholesterol loaded in growth 
10 arrest. 



Cholesterol effiux studies 

Efflux studies were carried out from 0 to 24 hours in the presence of purified 
ApoAI (10 p.g protein/mL medium). Efflux was determined as a percent of free 
15 cholesterol in the medipm after the cells were incubated for specified periods of time. 
All experiments were performed in triplicate, in the presence of cells from one control 
subject and the cells from the study subjects to be examined. All results showing an 
efflux defect were confirmed at least three times. 

20 Oligonucleotide synthesis 

Eight phoj^horothioate deoxyoligonucleotides complementaiy to various 
regions of the human ABC! cDNA sequence were obtained from GIBCO BRL, The 
oligonucleotides were purified by HPLC, The sequences of the antisense 
oligonucleotides and their location are listed. One skilled in the art will recognize that 

25 other ABCl antisense sequences can also be produced and tested for their ability to 
decrease ABC l-mediated cholesterol regulation. 



Name 


Sequence (5'- 3') 


m"RNA target 


% control 


AN-1 


GCAGAGGGCATGGGTTTATrTG (SEQ ID NO: 3) 


AUG codoh 


46 


AN-2 


GTGTrCCTGCAGAGGGCATG (SEQ ID NO: 30) 


AUG codon 


50 


AN'3 


CACrrCCAGTAACAGCTGAC (SEQ ID NO: 3 1) 


5*-Untranslated 


79 


AN-4 


CTTTGCGCATGTCCTTCATGC (SEQ ID NO: 32) 


Coding 


80 
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AN-5 GACATCAGCCCTCAGCATCrr(SEQIDNO:33) Coding 120 

AN-6: CAACAAGCCATGTTCCCTC (SEQ ID NO: 34) Coding 

AN-7: CATGTTCCCTCAGCCAGC(SEQIDNO:35) Coding 

AN-8: CAGAGCTCACAGCAGGGA C (SEQ ID NO: 36) Coding 

Cell transfection with antisense oligonucleotides 

Cells were grown in 35 mm culture dishes until 80 % confluent, then washed 
once with DMEM medium (serum and antibiotics free). One milliliter of DMEM 
(serum and antibiotics free) containing 500 nM antisense oligonucleotides and 5 
Hg/ml or 7.5 |tg/ml of lipofectin (GffiCO BRL) were added to each well according to 
the manufacturer's protocol. The cells were incubated at 37*^0 for 4 hours, and then 
the medium was replaced by DMEM containing 1 0% FCS. Twenty-four hours after 
the transfection, the total cell RNA was isolated. Ten micrograms of total RNA was 
resolved on a 1% of agarose-formaldehyde gel and transferred to nylon membrane. 
The blot was hybridized with a-^^P dCTP labeled hwmm ABC 1 cDNA overnight at 
eS^'C. The membrane was subsequently exposed to x-ray film. The hybridizing bands 
were scanned by optical densitometry and standard to 28S ribosome RNA. 

Cholesterol efflux with anti-ABC I oligonucleotides 

Human skin fibroblasts were plated in 6-well plates. The cells were labeled 
with ^H-cholesterol (0.2 ^tCi /ml) in DMEM with 1 0% FBS for two days when the 
cell reached 50% confluence. The cells were then transfected with the antisense 
ABCJ oligonucleotides at 500nM in DMEM (serum and antibiotic free) with 7.5 
^ig/ml Lipofectin (GIBCO BRL) according to the manufacturer's protocol. Following 
the transfection, and the cells were loaded with nonlipoprotein (20 ^g/ml) for 12 
hours in DMEM containing 2 mg/ml BSA without serum. The cellular cholesterol 
pools were then allowed to equilibrate for 6 hours in DMEM-BSA. The cholesterol 
efflux mediated by ApoAI (lOjig /ml, in DMEM-BSA) were then carried out which is 
48 hours after transfection. 

Radiolabeled cholesterol released into the medium is expressed as a 
percentage of total ^H-cholesterol per well (medium + cell ). Results are the mean +/- 
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SD of triplicate dishes. 

Determination of genomic structure of the ABC! gene 

Most splice junction sequences were detenmned from genomic sequence 
5 generated from BAG clones spanning the ABC 1 gene. More than 1 60 kb of 

genomic sequence were generated. Genomic sequences were aligned with cDNA 
sequences to identify intron/exon boundaries. In some cases, long distance PGR 
between adjacent exotis was used to amplify intron/exon boimdary sequences using 
amplification primers designed according to the cDNA sequence. 

10 

Functionality of the newly-discovered 60 amino acids at the N-terminus 
Antisense experiments 

Phosphorothioate antisentisense oligonucleotides were designed to be 
complementary to the regions of the cDNA near newly discovered translation start 

15 site. AN-6 and AN-7 both overlap the initiator methionine codon; this site is in the 
middle of oligonucleotide AN-6. AN-8 is complementary to the very 5' end of the 
ABCl cDNA. Antisense oligonucleotide AN-1 is complementary to the region of the 
ABCl cDNA corresponding to the site identified as the ABCl initiator methionine in 
AJ012376. Fig. 7G shows that antisense oligonucleotide AN-6 interferes with cellular 

20 cholesterol efflux in normal fibroblasts to the same extent as does antisense 

oligonucleotide AN-1 . Transfection with either of these antisense oligonucleotides 
results in a decrease in cellular cholesterol efflux almost as severe as that seen in FELA. 
cells. In general, antisense oligonucleotides complementary to coding sequences, 
especially near the 5* end of a gene's coding sequence, are expected to be more 

25 effective in decreasing the effective amount of transcript than are oligonucleotides 
directed to more V sequences or to non-coding sequences. The observation that 
AN-6 depresses cellular cholesterol efflux as effectively as AN-1 implies that both of 
these oligonucleotides are complementary to ABCl coding sequences, and that the 
amino terminal 60 amino acids are likely to be contained in ABCl protein. In 

30 contrast, the ineffectiveness of AN-8 shows that it is likely to be outside the protein 
coding region of the transcript, as predicted by presence of an in-frame stop codon 
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between the initiator methionine and the region targeted by AN-8. 
Antibody expgrimcntfi 

Polyclonal and monoclonal antibodies have been generated using peptides 
5 corresponding to discrete portions of the ABCl amino acid sequence. One of these, 

20-amino acid peptide #2 (Pep2: CSVRLSYPPYEQHECHFPNKA (SEQ ID NO: 37), 
in which the N-terminal cysteine was added to facilitate conjugation of the peptide) 
corresponds to a protein sequence within the 60 amino-tenninal amino acids of the 
newly-discovered ABCl protein sequence. The peptide was coupled to the KLH 

10 . carrier protein and 300 |xg injected at three intervals into two Balb/c mice over a four 
week period. The spleen was harvested from the mouse with the highest 
ELISA-determined immune response to free peptide, and the cells fused to NS-1 
myeloma cells by standard monoclonal antibody generation methods. Positive 
hybridomas were selected first by ELISA and then further characterized by western 

15 blotting using cultured primary human fibroblasts. Monoclonal cell lines producing a 
high antibody titre and specifically recognizing the 245 kD human ABCl protein 
were saved. The same size ABCl protein product was detected by antibodies directed 
to four other discrete regions of the same protein. The 245 kD band could be 
eliminated in competition experiments with appropriate free peptide, indicating that it 

20 represents ABCl protein (Fig. 13). 

The foregoing experiments indicate that ABCl protein is detected not only by 
antibodies corresponding to amino acid sequences within the previously-described 
ABCl amino acid sequence, but also by the Pep2 monoclonal antibody that 
recognizes an epitope within the newly-discovered N-terminal 60 amino acids. The 

25 N-terminal 60 amino acid region is therefore codings and is part of the ABCl protein* 
The epitope recognized by the Pep2 monoclonal antibody is also conserved 
among human, mouse, and chicken. Liver tissues from these three species employed 
in a Western blot produced an ABCl band of 245 kD when probed with the Pep2 
monoclonal antibody. This indicates that the 60 amino acid N-terminal sequence is 

30 part of the ABCl coding sequence in humans, mice, and chickens. Presence of this 
region is therefore evolutionarily conserved and likely to be of important fixnctional 
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significance for the ABCl protein. 



Bioinformatic analyses of ABCl protein sequences 

Transmembrane prediction programs indicate 13 transmembrane (TM) 

5 regions, the first one being between amino acids 26 and 42 

(http://psort,inbb.ac.jp:8800/psort/helpwww2.html#ealom). The tentative number of 
TM regions for the threshold 0.5 is 13. (INTEGRAL Likelihood -7.75 
Transmembrane 26-42). The other 12 TM range in value between -0.64 and -12 (full 
results below). It is therefore very likely that the newly-discovered 60 amino acids 

10 contain a TM domain, and that the amino end of ASCI may be on the opposite side of 
the membrane than originally thought. 



25 



30 



35 



15 



20 



25 



30 



ALOM: TM region allocation 
Init posidon for calculation: 1 
Tentative number ofTMs for the threshold 0.5: 13 

INTEGRAL Likelihood -7.75 Transmembrane 26- 

Likelihood - -3.98 Transmembrane 640- 

Likelifaood = -8.70 Transmembrane 690- 
LikeIihood^-9.6l 



INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 

PERIPHERAL Likelihood 



Transmembrane 717- 
Likelihood «-1.44 Transmembrane 749 - 
Likelihood = -0.64 Transmembrane 771- 
Likelihood--1.28 Transmembrane 1041 - 
Likelihood =-12,79 Transmembrane 1351 
Likelihood = -8.60 Transmembrane 1661 - 
Likelihood « -6.79 Transmembrane 1708 - 
Likelihood = -3.40 Transmembrane 1737 - 
Likelihood - - 1 .49 Transmembrane 1775 - 
Likelihood ^ -8.39 Transmembrane 1854 - 
0.69 (at 1643) 



42 
656 
706 
733 
765 
787 
1057 
-1367 
■1677 
1724 
1753 
-1791 
■1870 



ALOM score: -12.79 (number of TMSs: 13) 



40 



45 



SO 



There does not appear to be an obvious cleaved peptide, so this first 60 amino 
acid residues are not likely to be cleaved, and are therefore not specifically a 
signal/targeting sequence. No other signals (e.g„ for targeting to specific organelles) 
35 are apparent* 

Agonists aTid Antagonists 

Useful therapeutic compounds include those which modulate the expression, 
activity, or stability of ABCl. To isolate such compounds, ABCl expression, 
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biological activity, or regulated catabolism is measured following the addition of 
candidate compounds to a culture medium of ABCl-eKpressiag cells. Alternatively, 
the candidate compounds may be directly administered to animals (for example mice, 
pigs, or chickens) and used to screen for their effects on ABCl expression. 
5 In addition its role in the regulation of cholesterol, ABCl also participates in 

other biological processes for which the development of ABCl modulators would be 
useful. In one example, ABCl transports interleukin-ip (IL-ip) across the cell 
membrane and out of cells. IL-ip is a precursor of the inflammatory response and, as 
such, inhibitors or antagonists of ABCl expression or biological activity may be 

10 useful in the treatment of any inflammatory disorders, including but not limited to 

rheumatoid arthritis, systemic lupus crythematosis (SLE), hypo- or hyper- thyroidism, 
inflammatory bowel disease, and diabetes mellitus. In another example, ABCl 
expressed in macrophages has been shown to be engaged in the engulfinent and 
clearance of dead cells. The ability of macrophages to ingest tiiese apoptotic bodies is 

15 impaired after antibody-mediated blockade of ABCl. Accordingly, compounds that 
modulate ABCl expression, stability, or biological activity would be useful for the 
treatment of these disorders. 

ABCl expression is measured, for example, by standard Northern blot 
analysis using an ABC J nucleic acid sequence (or fragment thereof) as a hybridization 

20 probe, or by Western blot using an anti-ABC 1 antibody and standard techniques. The 
level of ABCl expression in the presence of the candidate molecule is compared to 
the level measured for the same cells, in the same culture medium, or in a parallel set 
of test animals, but in the absence of the candidate molecule. ABCl activity can also 
be measured using the cholesterol efflux assay. 

25 

Transcnptinnal Regulation of ABCt Expression 

ABCl mRNA is increased approximately 8-fold upon cholesterol loading. 
This increase is likely controlled at the transcriptional level. Using the promoter 
sequence described herein, one can identify transcription factors that bind to the 
30 promoter by performing, for example, gel shift assays, DNAse protection assays, or in 
vitro or in vivo reporter gene-based assays. The identified transcription factors are 
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themselves drug targets. In the case of ABC 1, drug compounds that act through 
modulation of transcription of ABCl could be used for HDL modulation, 
atherosclerosis prevention, and the treatment of cardiovascular disease. For example, 
using a compound to inhibit a transcription factor that represses ABCl would be 
5 expected to result in up-regulation of ABCl and, therefore, HDL levels. In another 
example, a compound that increases transcription factor expression or activity would 
also increase ABCl expression and HDL levels. 

Transcription factors known to regulate other genes in the regulation of 
apolipoprotein genes or other cholesterol- or lipid-regulating genes are of particular 

10 relevance. Such factors include, but are not limited to, the steroid response element 
binding proteins (SREBP-1 and SREBP-2), the PPAR (peroxisomal 
proliferation-activated receptor) transcription factors. Several consensus sites for 
certain elements are present in the sequenced region 5' to the ABCl gene (Fig. 16) 
and are likely to modulate ABCl expression. For example, PPARs may alter 

15 transcription of ABCl by mechanisms including heterodimerization with retinoid X 
receptors (RXRs) and then binding to specific proliferator response elements 
(PPREs). Examples of such PPARs include PPARa, p, y and 5. These distmct 
PPARs have been shown to have transcriptional regulatory effects on different genes. 
PPARa is expressed mainly in liver, whereas PPARy is expressed in predominantly in 

20 adipocytes. Both PPARa and PPARy are found in coronary and carotid artery 

atherosclerotic plaques and in endothelial cells, smooth muscle cells, monocytes and 
monocyte-derived macrophages. Activation of PPARa results in altered lipoprotein 
metabolism through PPARa' s effect on genes such as lipoprotein lipase (LPL), 
apolipoprotein Cin (apo CIII) and apolipoprotein AI (^o AI) and AH (apo All). 

25 PPARa activation results in overexpression of LPL and apoA-I and apoA-II, but 
inhibits the expression of apo CIIL PPARa activation also inhibits inflammation, 
stimulates lipid oxidation and increases the hepatic uptake and esterification of free 
fatty acids (FFA's). PPARa and PPARy activation may inhibit nitric oxide (NO) 
synthase in macrophages and prevent interleukin-1 (IL-1) induced expression of IL-6 

30 and cyclo-oxygenase-2 (COX-2) and thrombin induced endothelin-1 expression 

secondary to negative transcriptional regulation of NF-KB and activation of protein- 1 
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signaling pathway. It has also been shown that PPARa induces s^ptosis in 
monocyte-derived macrophages through the inhibition of NF-KB activity. 

Activation of PPARa can be achieved by compounds such as fibrates, 
P-estradiol, arachidonic acid derivatives. Wy-14,643 and LTB4 or 8(s)HETE, 
5 PPARy activation can be achieved through compounds such as thiozolidinedione 

antidiabetic drugs, 9-HODE and 13-HODE. Additional compounds such as nicotinic 
acid or HMG CoA reductase inhibitors may also alter the activity of PPARs. 

Compounds which alter activity of any of the PPARs (e.g., PPARa or PPARy) 
may have an effect on ABCl expression and thereby could affect HDL levels, 

10 atherosclerosis and risk of CAD. PPARs are also regulated by fatty acids (including 
modified fatty acids such as 3 thia fatty acids), leukotrienes such as leukotriene B4 
and prostaglandin J2, which is a natural activator/ligand for PPARy, Drugs that 
modulate PPARs may therefore have an important effect on modulating Hpid levels 
(including HDL and triglyceride levels) and altering CAD risk. This effect could be 

15 achieved through the modulation of ABCl gene expression. Drugs may also effect 
ABCl gene expression and thereby HDL levels, by an indirect effect on PPARs via 
other transcriptional factors such as adipocyte differentiation and determination 
factor-1 (ADD-1) and sterol regulatory element binding protein-1 and 2 (SREBP-1 
and 2). Drugs with combined PPARa and PPARy agonist activity or PPARa and 

20 PPARy agonists given in combination for example, may increase HDL levels even 
more. 

A PPAR binding site (PPRE element) is found 5' to the ABCl gene 
(nucleotides 2150 to 2169 of SEQ ID NO: 14). Like the PPRE elements found in the 
C-ACS, HD, CYP4A6 and ApoA-I genes, this PPRE site is a trimer related to the 

25 PPRE consensus sequence. Partly because of its similarity in the nimiber and 

arrangement of repeats in this PPAR binding site, this element in particular is very 
likely to be of physiological relevance to the regulation of the ABCl gene. 

Additional transcription factors which may also have an effect in modulating 
ABCl gene expression and thereby HDL levels, atherosclerosis and CAD risk 

30 include; REV-ERBa, SRBBP-1 & 2, ADD-1 , EBPa, CREB binding protein, P300, 

HNF 4, RAR, LXR, and RORa. Additional degenerate binding sites for these factors 
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can be found through examination of the sequence in SEQ ID NO: 14* 

AHriitinnal utility of ABCl polypeptides, nucleic acids, and modulators 

ABCl may act as a transporter of toxic proteins or protein fragments {e.g.^ 
5 APP) out of cells. Thus, ABCl agonists/upregulators may be useful in the treatment 
of other disease areas, including Alzheimer's disease* Niemann-Pick disease, and 
Huntington's disease. 

ABC transporters have been shown to increase the uptake of long chain fatty 
acids from the cytosol to peroxisomes and, moreover, to play a role in P-oxidation of 
10 very long chain fatty acids. Importantly, in x-linked adrenoleukodystrophy (ALD), 

fatty acid metabolism is abnormal, due to defects in the peroxisomal ABC transporter. 
Any agent that upregulates ABC transporter expression or biological activity may 
therefor be useful for the treatment of ALD or any other lipid disorder. 

ABCl is expressed in macrophages and is required for engulfinent of cells 
15 undergoing programmed cell death. The ^optotic process itself, and its regulation, 
have important implications for disorders such as cancer, one mechanism of which is 
failure of cells to undergo cell death appropriately. ABCl may facilitate apoptosis, 
and as such may represent an intervention point for cancer treatment. Increasing 
ABCl expression or activity or otherwise up-regulating ABCl by any method may 
20 constitute a treatment for cancer by increasing apoptosis and thus potentially 
decreasing the aberrant cellular proliferation characterized by this disease. 
Conversely, down-regulation of ABCl by any method may provide opportunity for 
decreasing apoptosis and allowing increased proliferation of cells in conditions where 
cell growth is limited* Such disorders include but are not limited to neurodeficiencies 
25 and neurodegeneration, and growth disorders. ABCl could, therefore, potentially be 
used as a method for identification of compounds for use in the treatment of cancer, or 
in the treatment of degenerative disorders. 

Agents that have been shown to inhibit ABCl include, for example, the anti- 
diabetic agents glibenclamide and glyburide, flufenamic acid, diphenylamine-2- 
30 carbonic acid, sulfobromophthalein, and DIDS. 

Agents that upregulate ABCl expression or biological activity include but are 
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not limited to protein kinase A, protein kinase C, vanadate^ okadaio acid, and IBMXl. 

Those in the art will recognize that other compounds can also modixlate ABC 1 
biological activity, and these compounds are also in the spirit of the invention. 

5 Drag scree ns based on the ABCl gene qr prp^ ^ii l 

The ABCl protein and gene can be used in screening assays for identification 
of compounds which modulate its activity and may be potential drugs to regulate 
cholesterol levels. Useful ABCl proteins include wild-type and mutant ABCl 
proteins or protein fragments, in a recombinant form or endogenously expressed. 

10 Drug screens to identify compounds acting on the ABCl expression prodtict may 
employ any functional feature of the protein. In one example, the phosphorylation 
state or other post-translational modification is monitored as a measure of ABCl 
biological activity, ABCl has ATP binding sites, and thus assays may wholly or in 
part test the ability of ABCl to bind ATP or to exhibit ATPase activity. ABCl , by 

15 analogy to similar proteins, is thought to be able to form a chaimel-like structure; drug 
screening assays could be based upon assaying for the ability of the protein to form a 
channel, or upon the ability to transport cholesterol or another molecule, or based 
upon the abihty of other proteins bound by or regulated by ABCl to form a channel. 
Alternatively, phospholipid or lipid transport can also be used as measures of ABCl 

20 biological activity. 

There is evidence that, in addition to its role as a regulator of cholesterol 
levels, ABCl also transports anions. Functional assays could be based upon this 
property, and could employ drug screening technology such as (but not limited to) the 
ability of various dyes to change color in response to changes in specific ion 

25 concentrations in such assays can be performed in vesicles such as liposomes, or 
adapted to use whole cells. 

Drug screening assays can also be based upon the ability of 7\BC1 or other 
ABC transporters to interact with other proteins. Such interacting proteins can be 
identified by a variety of methods known in the art, including, for example, 

30 radioimmunoprecipitation, co-immxinoprecipitation, co-purification, and yeast two- 
hybrid screening. Such interactions can be further assayed by means including but 
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not limited to fluorescence polarization or scintillation proximity methods. Drug 
screens can also be based upon functions of the ABCl protein deduced upon X-ray 
crystallography of the protein and comparison of its 3-D structure to that of proteins 
with known functions. Such a crystal structure has been determined for the 

5 prokaryotic ABC family member HisP, histidine permease. Drug screens can be 

based upon a function or feature apparent upon creation of a transgenic or knockout 
mouse, or upon overexpression of the protein or protein fragment in mammalian cells 
in vitro. Moreover, expression of mammaJian (e.g., human) ABC 1 in yeast or C 
elegans allows for screening of candidate compounds in wild-type and mutant 

10 backgrounds, as well as screens for mutations that enhance or suppress an ABCl- 

dependent phenotype. Modifier screens can also be performed in ABCl transgenic or 
knock-out mice. 

Additionally, drug screening assays can also be based upon ABCl functions 
deduced upon antisense interference with the gene fimction. Intracellular localization 

15 of ABC 1 , or effects which occur upon a change in intracellular localization of the 
protein, can also be used as an assay for drug screening. Imniunocytochemical 
methods will be used to determine the exact location of the ABCl protein. 

Hxmian and rodent ABCl protein can be used as an antigen to raise antibodies, 
including monoclonal antibodies. Such antibodies will be useful for a wide variety of 

20 purposes, including but not limited to functional studies and the development of drug 
screening assays and diagnostics. Monitoring the influence of agents (e.g., drugs, 
compounds) on the expression or biological activity of ABC l can be applied not only 
in basic drug screening, but also in clinical trials. For example, the effectiveness of an 
agent determined by a screening assay as described herein to increase ABCl gene 

25 expression, protein levels, or biological activity can be monitored in clinical trails of 
subjects exhibiting altered ABCl gene expression, protein levels, or biological 
activity. Alternatively, the effectiveness of an agent determined by a screening assay 
to modulate ABCl gene expression, protein levels, or biological activity can be 
monitored in clinical trails of subjects exhibiting decreased altered gene expression, 

30 protein levels, or biological activity. In such clinical trials, the expression or activity 
of ABCl and, preferably, other genes that have been implicated in, for example, 
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cardiovascular disease can be used to ascertain the effectiveness of a particular drug. 

For example, and not by way of limitation, genes, including ABCl, that are 
modulated in cells by treatment with an agent (e.g., compomid, drug or small 
molecule) that modulates ABCl biological activity (e.g., identified in a screening 
5 assay as described herein) can be identified. Thus, to study the effect of agents on 
cholesterol levels or cardiovascular disease, for example, in a clinical trial, cells can 
be isolated and RNA prepared and analyzed for the levels of expression of ABCl and 
other genes implicated in the disorder. The levels of gene expression can be 
quantified by Northern blot analysis or RT-PCR, or, alternatively, by measuring the 

10 amoimt of protein produced, by one of a number of methods known in the art, or by 
measuring the levels of biological activity of ABCl or other genes. In this way, the 
gene expression can serve as a marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may be deteimined before, and at 
various points during, treatment of the individual with the agent. 

15 In a preferred embodiment, the present invention provides a method for 

monitoring the effectiv^ess of treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomiraetic, protein, pqptide, nucleic acid, small molecule, or other 
drug candidate identified by the screening assays described herein) including the steps 
of (i) obtaining a pre-administration sample from a subject prior to administration of 

20 the agent; (ii) detecting the level of expression of an ABCl protein, mRNA, or 
genomic DNA in the preadministration sample; (iii) obtaining one or more 
post-administration samples from the subject; (iv) detecting the level of expression or 
activity of the ABCl protein, mRNA, or genomic DNA in the post-administration 
samples; (v) comparing the level of expression or activity of the ABCl protein, 

25 mRNA, or genomic DNA in the pre-administration sample with the ABCl protein, 
mRNA, or genomic DNA in the post administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent may be desirable to increase the expression or 
activity of ABCl to higher levels than detected, i.e., to increase the effectiveness of 

30 the agent. Alternatively, decreased administration of the agent may be desirable to 
decrease expression or activity of ABCl to lower levels than detected. 
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The ABCI gene or a fragment thereof can be used as a tool to express the 
protein in an appropriate cell in vitro or in vivo (gene therapy), or can be cloned into 
expression vectors which can be used to produce large enough amounts of ABCI 
protein to use in in vitro assays for drug screening. Expression systems which may be 
5 employed include baculovirus, herpes virus, adenovirus, adeno-associated virus, 

bacterial systems, and eucaryotic systems such as CHO cells. Naked DNA and DNA- 
liposome complexes can also be used. 

Assays of ABCI activity includes binding to intracellular interacting proteins; 
interaction with a protein that up-regulates ABCI activity; interaction with HDL 

10 particles or constituents; interaction with other proteins which facilitate interaction 
with HDL or its constituents; and measurement of cholesterol efflux. Furthermore, 
assays may be based upon the molecular dynamics of macromoiecules, metabolites 
and ions by means of fluorescent-protein biosensors. Alternatively, the effect of 

candidate modulators on expression or activity may be measured at the level of ABCI 

15 protein production using the same general qf>proach in combination with standard 

immunological detection techniques, such as Western blotting or immunoprecipitation 
with an ABCI -specific antibody. Again, useful cholesterol-regulating or anti-CVD 
therapeutic modulators are identified as those which produce an change in ABCI 
polypeptide production. Agonists may also affect ABCI activity without any effect 

20 on expression level. 

Candidate modulators may be purified (or substantially purified) molecules or 
may be one component of a mixture of compounds (e,g,, an extract or supernatant 
obtained from cells). In a mixed compound assay, ABCI expression is tested against 
progressively smaller subsets of the candidate compoimd pool (e,g,, produced by 

25 standard purification techniques, e.g,, HPLC or FPLC; Ausubel et al.) until a single 
compound or minimal compound mixture is demonstrated to modulate ABC 1 
expression. 

Agonists, antagonists, or mimetics found to be effective at modulating the 
level of cellular ABCI expression or activity may be confirmed as useful in animal 
30 models (for example, mice, pigs, rabbits, or chickens). For example, the compound 
may ameliorate the low HDL levels of mouse or chicken hypoalphalipoproteinemias. 



-50- 



wo 00/55318 



PCT/IBOO/00532 



A compotind that promotes an increase in ABCl expression or activity is 
considered particularly useful in the invention; such a molecule may be used, for 
example, as a therapeutic to increase the level or activity of native, cellular ABCl and 
thereby treat a low HDL condition in an animal (for example, a human). 
5 One method for increasing ABC biological activity is to increase the 

stabilization of the ABC protein or to prevent its degradation. Thus, it would be 
useful to identify mutations in an ABC polypeptide (e.g., ABCl) that lead to 
increased protein stability. These mutations can be incorporated into any protein 
therapy or gene therapy undertaken for the treatment of low HDL-C or any other 

10 condition resulting from loss of ABCl biological activity. Similarly, compounds that 
increase the stability of a wild-type ABC polypeptide or decrease its catabolism may 
also be useful for the treatment of low HDL-C or any other condition resulting from 
loss of ABCl biological activity. Such mutations and compounds can be identified 
using the methods described herein. 

15 In one example, cells expressing an ABC polypeptide having a mutation are 

transiently metabolically labeled during translation and the half-life of the ABC 
polypeptide is determined using standard techniques. Mutations that increase the 
half-life of an ABC polypeptide are ones that increase ABC protein stability. These 
mutations can then be assessed for ABC biological activity. They can also be used to 

20 identify proteins that affect the stability of ABC 1 mRNA or protein. One can then 

assay for compounds that act on these factors or on the ability of these factors to bind 
ABCl. 

In another example, cells expressing wild-type ABC polypeptide are 
transiently metabolically labeled during translation, contacted with a candidate 
25 compounds, and the half-life of the ABC polypeptide is determined using standard 
techniques. Compounds that increase the half-life of an ABC polypeptide are useful 
compounds in the present invention. 

If desired, treatment with an agonist of the invention may be combined with 
any other HDL-raising or anti-CVD therapies. 
30 Itisunderstoodthat, while ABCl is the preferred ABC transporter for the 

drug screens described herein, other ABC transporters can also be used. The 
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replacement of ABC 1 with another ABC transporter is possible because it is likely 
that ABC transporter family members, such as ABC2, ABCR, or ABC8 will have a 
similar mechanism of regulation. 

Exemplary assays are described in greater detail below. 

5 

Protein-based assays 

ABCl polypeptide (purified or unpurified) can be used in an assay to 
determine its ability to bind another protein (including, but not limited to, proteins 
foxmd to specifically interact with ABCl). The effect of a compound on that binding 
10 is then detennined. 



Protein Interaction Assays 

ABCl protein (or a polypeptide jfragment thereof or an epitope-tagged form or 
fragment thereof) is harvested from a suitable source (e.g., from a prokaryotic 

15 expression system, eukaryotic cells, a cell- free system, or by immunoprecipitation 
from ABCl -expressing cells). The ABCl polypeptide is then bound to a suitable 
support (e.g., nitrocellulose or an antibody or a metal agarose colimm in the case of, 
for example, a his-tagged form of ABCl). Binding to the support is preferably done 
imder conditions that allow proteins associated with ABCl polypeptide to remain 

20 associated with it. Such conditions may include use of buffers that minimize 

interference with protein-protein interactions. The binding step can be done in the 
presence and absence of compounds being tested tor their ability to interfere with 
interactions between ABCl and other molecules. If desired, other proteins (e.g., a cell 
lysate) are added, and allowed time to associate with the ABC polypeptide. The 

25 immobilized ABCl polypeptide is then washed to remove proteins or other cell 
constituents that may be non-specifically associated with it the polypeptide or the 
support. The immobilized ABCl polypeptide is then dissociated from its support, and 
so that proteins boimd to it are released (for example, by heating), or, alternatively, 
associated proteins are released from ABCl without releasing the ABCl polypeptide 

30 from the support. The released proteins and other cell constiments can be analyzed, 
for example, by SDS-PAGE gel electrophoresis. Western blotting and detection with 
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specific antibodies, phosphoaxnino acid analysis, protease digestion, protein 
sequencing, or isoelectric focusing. Nonnal and mutant forms of ABCl can be 
employed in these assays to gain additional infonnation about which part of ABCl a 
given factor is binding to. In addition, when incompletely purified polypeptide is 
5 employed, comparison of the nonnal and muatant fonns of the protein can be used to 
help distinguish true binding proteins. 

The foregoing assay can be performed using a pxuifLed or semipurified protein 
or other molecule that is known to mteract with ABC 1 . This assay may include the 
following steps. 

10 1, Harvest ABCl protein and couple a suitable fluorescent label to it; 

2. Label an interacting protein (or other molecule) with a second, different 
fluorescent label. Use dyes that will produce different quenching patterns when they 
are in close proximity to each other vs. when they are physically separate (i.e., dyes 
that quench each other when they are close together but fluoresce when they are not in 

15 close proximity); 

3 . Expose the interacting molecule to the immobilized ABCl in the presence 
or absence of a compound being tested for its ability to interfere with an interaction 
between the two; and 

4. Collect fluorescent readout data. 

20 

Another assay is includes Fluorescent Resonance Energy Transfer (FKET) 
assay. This assay can be performed as follows. 

I. Provide ABC 1 protein or a suitable polypeptide fragment thereof and 
couple a suitable FRET donor (e.g.,. nitro-benzoxadiazole (NBD)) to it; 
25 2. Label an interacting protein (or other molecule) with a FRET acceptor (e.g.^ 

rhodamine); 

3. Expose the acceptor-labeled interacting molecule to the donor-labeled 
ABCl in the presence or absence of a compound being tested for its ability to 
interfere with an interaction between the two; and 
30 4. Measure fluorescence resonance energy transfer. 
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Quenching and FRET assays are related. Either one can be applied in a given 
case, depending on which pair of fluorophores is used in the assay. 

Membrane permeability assay 
5 The ABCl protein can also be tested for its effects on membrane permeability. 

For example, beyond its putative ability to translocate lipids, ABCl might affect the 
permeability of membranes to ions. Other related membrane proteins, most notably 
the cystic fibrosis transmembrane conductance regulator and the sulfonylurea 
receptor, are associated with and regulate ion channels. 

10 ABCl or a fragment of ABCl is incorporated into a synthetic vesicle* or, 

alternatively, is expressed in a cell and vesicles or other cell sub-structures containing 
ABCl are isolated. The ABCl -containing vesicles or cells are loaded with a reporter 
molecule (such as a fluorescent ion indicator whose fluorescent properties change 
when it binds a particxilar ion) that can detect ions (to observe outward movement), or 

15 alternatively, the external medium is loaded with such a molecule (to observe inward 
movement). A molecule which exhibits differential properties when it is inside the 
vesicle compared to when it is outside the vesicle is preferred. For example, a 
molecule that has quenching properties when it is at high concentration but not when 
it is at another low concentration would be suitable. The movement of the charged 

20 molecule (either its ability to move or the kinetics of its movement) in the presence or 
absence of a compound being tested for its ability to affect this process can be 
determined. 

In another assay, membrane permeability is determined electro- 
physiologically by measuring ionic influx or efflux mediated by or modulated by 
25 ABCl by standard electrophysiological techniques. A suitable control (e.g., TD cells 
or a cell line with very low endogenous ABCl expression) can be used as a control in 
the assay to determine if the effect observed is specific to cells expressing ABCl . 

In still another assay, uptake of radioactive isotopes into or out of a vesicle can 
be measured. The vesicles are separated from the extravesicular medium and the 
30 radioactivity in the vesicles and in the medium is quantitated and compared. 
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Nucleic flcid-hflsed aissays 

ABCl nucleic acid may be used in an assay based on the binding of factors 
necessary for ABCl gene transcription. The association between the ABCl DNA and 
the binding factor may be assessed by means of any system that discriminates 
between protein-bound and non-protein-boimd DNA (e*g., a gel retardation assay). 
The effect of a compound on the binding of a factor to ABCl DNA is assessed by 
means of such an assay. In addition to in vitro binding assays, in vivo assays in which 
the regulatory regions of the ABCl gene are linked to reporter genes can also be 
performed. 

\Assays mea5turing MKCA stability 

A cell-based or cell-free system can be used to screen for compounds based on 
their effect on the half-life of ABCl mRNA or ABCl protein. The assay may employ 
labeled mRNA or protein. Alternatively, ABCl mRNA may be detected by means of 
specifically hybridizing probes or a quantitative PCR assay. Protein can be 
quantitated* for example, by fluorescent antibody-based methods. 

In vitro mRNA stability assay 

1 , Isolate or produce, by in vitro transcription, a suitable quantity of ABCl 

mRNA; 

2. Label the ABCl mRNA; 

3, Expose aliquots of the mRNA to a cell lysate in the presence or absence of a 
compound being tested for its ability to modulate ABCl mRNA stability; 

4. Assess intactness of the remaining mRNA at suitable time points. 

In vitro protein stability assay 

1 . Express a suitable amount of ABCl protein; 

2. Label the protein; 

3 . Expose aliquots of the labeled protein to a cell lysate in the presence or 
absence of a compound being tested for its ability to modulate ABCl protein stability; 

4. Assess intactness of the remaining protein at suitable time points 
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Jn vivo mRNA or protein stability assay 

1 . Incubate cells expressing ABC 1 mRNA or protein with a tracer 
(radiolabeled ribonucleotide or radiolabeled amino acid, respectively) for a very brief 
time period (e.g., five minutes) in the presence or absence of a compound being tested 
for its effect on mRNA or protein stability; 

2. Incubate with unlabeled ribonucleotide or amino acid; and 

3. Quantitate the ABCl mRNA or protein radioactivity at time intervals 
beginning with the start of step 2 and extending to the time when the radioactivity in 
ABCl mRNA or protein has declined by approximately 80%, It is preferable to 
separate the intact or mostly intact mRNA or protein from its radioactive breakdown 
products by a means such as gel electrophoresis in order to quantitate the mRNA or 
protein. 

Assays measuring inhibition of dominant tiegative activity 

Mutant ABCl polypeptides are likely to have dominant negative activity (i.e., 
activity that interferes with wild-type ABCl function). An assay for a compound that 
can interfere with such a mutant may be based on any method of quantitating normal 
ABCl activity in the presence of the mutant. For example, nomial ABCl facilitates 
cholesterol efflux, and a dominant negative mutant would interfere with this effect. 
The ability of a compound to counteract the effect of a dominant negative mutant may 
be based on cellular cholesterol efflux, or on any other normal activity of the wild- 
type ABCl that was inhibitable by the mutant 

Assays measuring phosphorylation 

The effect of a compound on ABCl phosphorylation can be assayed by 
methods that quantitate phosphates on proteins or that assess the phosphorylation state 
of a specific residue of a ABCl . Such methods include but are not limited to ^^P 
labeUing and immunoprecipitation, detection with antiphosphoamino acid antibodies 
(e.g., antiphosphoserine antibodies), phosphoamino acid analysis on 2-dimensionai 
TLC plates, and protease digestion fingerprinting of proteins followed by detection of 
^^-labeled jQragments. 
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Assays measuring nther post-tranRlatioTial mndifinatinng 

The effect of a compound on the post-translational modification of ABCl is 
based on any method enable of quantitating that particular modification. For 
example, effects of compoxmds on glycosylation may be assayed by treating ABCl 
5 with giycosylase and quantitating the amount and nature of carbohydrate released. 

Assays measuring ATP binding 

The ability of ABCl to bind ATP provides another assay to screen for 
compounds that affect ABCl , ATP binding can be quantitated as follows, 
2^ 10 1 . Provide ABC 1 protein at an appropriate level of purity and reconsititute it 

in a lipid vesicle; 

2. Expose the vesicle to a labeled but non-hydrolyzable ATP analog (such as 
gamma ^'S-ATP) in the presence or absence of compounds being tested for their 

25 effect on ATP binding. Note that azido-ATP analogs can be used to allow covalent 

1 5 attachment of the azido-ATP to protein (by means of U*V. light), and permit easier 
quantitation of the amount of ATP bound to the protein* 

3. Quantitate the amount of ATP analog associated with ABCl 

30 

Assays measuring ATPase activity 
20 Quantitation of the ATPase activity of ABCl can also be assayed for the effect 

35 of compounds on ABCl . This is preferably pertbrmed in a cell-free assay so as to 

separate ABC 1 from the many other ATPases in the cell. An ATPase assay may be 
performed in the presence or absence of membranes^ and with or without integration 
of ABCl protein into a membrane* If performed in a vesicle-based assay, the ATP 

40 

25 hydrolysis products produced or the ATP hydrolyzed may be measured within or 

outside of the vesicles, or both. Such an assay may be based on disappearance of 

ATP or appearance of ATP hydrolysis products. 
4Q For high-throughput screening, a coupled ATPase assay is preferable. For 

example, a reaction mixture containing pyruvate kinase and lactate dehydrogenase can 
30 be used. The mixture includes phosphoenolpyruvate (PEP), nicotinamide adenine 

dinucleotide (NAD+), and ATP. The ATPase activity of ABCl generates ADP from 
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ATP. The ADP is then converted back to ATP as part of the pyruvate kinase reaction. 
The product, pyruvate, is then converted to lactate. The latter reaction generates a 
colored quinone (NADH) from a colorless substrate (NAD-*-), and the entire reaction 
can be monitored by detection of the color change upon formation of NADH. Since 
5 ADP is limiting for the pyruvate kinase reaction, this coupled system precisely 
monitors the ATPase activity of ABCl . 

Afigays mftflRiintigr t^holesterol efflnx 

A transport-based assay can be performed in vivo or in vitro. For example* the 

10 assay may be based on any part of the reverse cholesterol transport process that is 

readily re-created in culture, such as cholesterol or phospholipid efflux. Alternatively, 
the assay may be based on net cholesterol transport in a whole orgaiaism, as assessed 
by means of a labeled substance (such as cholesterol). 

For high throughput, fluorescent hpids can be used to measure 

15 ABCl -catalyzed lipid efflux. For phospholipids, a fluorescent precursor, 

C6-NBD-phosphatidic acid, can be used. This lipid is taken up by cells and 
dephosphorylated by phosphatidic acid phosphohydrolase. The product, 
NBD-diglyceride, is then a precursor for synthesis of glycerophospholipids like 
phosphatidylcholine. The efflux of NBD-phosphatidylcholine can be monitored by 

20 detecting fluorescence resonance energy transfer (FRET) of the NBD to a suitable 
acceptor in the cell culture medium. This acceptor can be rhodamine-labeled 
phosphatidylethanolamine, a phospholipid that is not readily taken up by cells. The 
use of short-chain precursors obviates the requirement for the phospholipid transfer 
protein in the media. For cholesterol NBD-cholesterol ester can be reconstituted into 

25 LDL, The LDL can efficiently deliver this lipid to cells via the LDL receptor 

pathway. The NBD-cholesterol esters are hydrolyzed in the lysosomes, resulting in 
NBD-cholesterol that can now be transported back to the plasma membrane and 
efflux from the cell. The efflux can be monitored by the aforementioned FRET assay 
in which NBD transfers its fluorescence resonance energy to the 

30 rhodamine-phosphatidylethanoline acceptor. 
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Animal Modfil Systems 

Compounds identified as having activity in any of the above-described assays 
are subsequently screened in any available animal model system, including, but not 
limited to, pigs, rabbits, and WHAM chickens. Test compounds are administered to 
5 these animals according to standard methods. Test compounds may also be tested in 
mice bearing mutations in the ABCl gene. Additionally, compounds may be screened 
for their ability to enhance an interaction between ABCl and any HDL particle 
constituent such as ApoAI, ApoAII, or ApoE. 

10 Thft rhnlesterol efflux assay as a drug screen 

The cholesterol efflux assay measures the ability of cells to transfer cholesterol 
to an extracellular acceptor molecule and is dependent on ABCl function. In this 
procedure, cells are loaded with radiolabeled cholesterol by any of several 
biochemical pathways (Marcil et aL, Arterioscler. Thromb. Vase. Biol. 19:159-169, 

15 1999). Cholesterol efflux is then measured after incubation for various times 

{typically 0 to 24 hours) in the presence of HDL3 or purified ApoAI. Cholesterol 
efflux is determined as the percentage of total cholesterol in the culture medium after 
various times of incubation. ABCl expression levels and/or biological activity are 
associated with increased efflux while decreased levels of ABCl are associated with 

20 decreased cholesterol efflux. 

This assay can be readily adapted to the format used for drug screening, which 
may consist of a multi-well (e.g., 96-well) format. Modification of the assay to 
optimize it for drag screening would include scaling down and streamlining the 
procedure, modifying the labeling method, using a different cholesterol acceptor, 

25 altering the incubation time, and changing the method of calculating cholesterol 

efflux. In all these cases, the cholesterol efflux assay remains conceptually the same, 
though experimental modifications may be made. A transgenic mouse overexpressing 
ABCl would be expected to have higher than normal HDL levels. 

30 Knock-out mfiuse..Tnfidel 

An animal, such as a mouse, that has had one or both ABCl alleles inactivated 
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by homologous recombination) is likely to have low KDDL-C levels, and ihus is a 
preferred animal model for screening for compounds that raise HDL-C levels. Such 
an animal can be produced using standard techniques. h\ addition to the initial 
screening of test compounds, the animals having mutant ABCl g^es are useful for 
5 further testing of efficacy and safety of dmgs or agents first identified using one of the 
other screening methods described herein. Cells taken from the animal and placed in 
culture can also be exposed to test compounds. HDL-C levels can be measured using 
standard techniques, such as those described herein. 

10 WHAM chickens: an animal model for low HDL cholesterol 

Wisconsin Hypo- Alpha Mutant (WHAM) chickens arose by spontaneous 
mutation in a closed flock. Mutant chickens came to attention through their a Z- 
linked white shank and white beak phenotype referred to as 'recessive white skin' 
(McGibbon, 1981) and were subsequently found to have a profound deficiency of 

15 HDL (Poemamaer^i/., 1990). 

This chicken low HDL locus (Y) is Z-linked, or sex-linked. (In birds, females 
are ZW and males are ZZ), Genetic mapping placed the Y locus on the long arm of 
the Z chromosome (Bitgood, 1985), pro?dmal to the ID locus (Bitgood, 1988). 
Examination of current public mapping data for the chicken genome mapping project, 

20 ChickMap (maintained by the Roslin Institute; http://www.ri.bbsrc.ac.iik/ 

chickmap/ChickMapHomePage.html) showed that a region of synteny with human 
chromosome 9 lies on the long arm of the chicken Z chromosome (Zq) proximal to 
the ID locus. Evidence for this region of synteny is the location of the chicken 
aldolase B locus (ALDOB) within this region. The human ALDOB locus maps to 

25 chromosome 9q22.3 (The Genome Database, http://gdbwww.gdb.org/), not far from 
the location of human ABCL This comparison of maps showed that the chicken Zq 
region near chicken ALDOB and the human 9q region near human ALDOB represent 
a region of synteny between human and chicken. 

Since a low HDL locus maps to the 9q location in humans and to the Zq 

30 region in chickens, these low HDL loci are most probably located within the syntenic 
region. Thus we predicted that ABCl is mutated in WHAM chickens. In support of 
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this, we have identified an E«^K mutation at a position that corresponds to amino acid 
89 of human ABCl (Figs. 14 and 15). This non-conservative substitution is at a 
position that is conserved among human, mouse, and chicken, indicating that it is in a 
region of the protein likely to be of functional importance. 
5 Discovery of the WHAM mutation in the amino-tenninal portion of the ABCl 

protein also establishes the importance of the amino-terminal region. This region may 
be critical because of association with other proteins required to carry out cholesterol 
efflux or related tasks. It may be an important regulatory region (there is a 
phosphorylation site for casein kinase near the mutated residue), or it may help to 
10 dictate a precise topological relationship with cellular membranes (the N-terminal 60 
amino acid region contains a putative membrane-spanning or membrane-associated 
segment). 

The amino-terminal region of the protein (up to the first 6-TM region at 
approximately amino acid 639) is an ideal tool for screening factors that affect ABCl 
15 activity. It can be expressed as a truncated protein in ABCl wild type cells in order to 
test for interference of the normal ABCl function by the truncated protein. If the 
fragment acts in a dominant negative way, it could be used in immunoprecipitations to 
identify proteins that it may be competing away from the normal endogenous protein. 
The C-terminus also lends itself to such experiments, as do the intracellular 
20 portions of the molecule, expressed as fragments or tagged or fusion proteins, in the 
absence of transmembrane regions. 

Since it is possible that there are several genes in the human genome which 
affect cholesterol efflux, it is important to establish that any animal model to be used 
for a human genetic disease represents the homologous locus in that animal, and not a 
25 different locus with a similar function. The evidence above establishes that the 
chicken Y locus and the human chromosome 9 low HDL locus are homologous. 
WHAM chickens are therefore an important animal model for the identification of 
drugs that modulate cholesterol efflux. 

The WHAM chickens' HDL deficiency syndrome is not, however, associated 
30 with an increased susceptibility to atherosclerosis in chickens. This probably reflects 
the shorter lifespan of the chicken rather than an inherent difference in the function of 
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the chicken ABCl gene compared to the human gene. We propose the WHAM 
chicken as a model for human low HDL for the development and testing of drugs to 
raise HDL in humans* Such a model could be employed in several forms, through the 
use of cells or other derivatives of these chickens, or by the use of the chickens 
themselves in tests of drug effectiveness, toxicity, and other drug development 
purposes. 

Therapy 

Compounds of the invention, including but not linfiited to, ABCl polypeptides, 
ABCI nucleic acids, other ABC transporters, and any therapeutic agent that modulates 
biological activity or expression of ABCl identified using any of the methods 
disclosed herein, may be administered with a pharmaceutically-acceptable diluent, 
carrier, or excipient, in unit dosage form. Conventional pharmaceutical practice may 
be employed to provide suitable formulations or compositions to administer such 
compositions to patients. Although intravenous administration is preferred, any 
appropriate route of administration may be employed, for example, percnteral, 
subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, intraventricular, 
intracapsular, intraspinal, intracistemal, intraperitoneal, intranasal, aerosol, or oral 
administration* Therapeutic formulations may be in the form of liquid solutions or 
suspension; for oral administration, formulations may be in the form of tablets or 
capsules; and for intranasal formulations, in the form of powders, nasal drops, or 
aerosols. 

Methods well known in the art for making formulations are found in, for 
example, Remin gton: The Science and Practice of Pharmacy , (19th ed.) ed. A.R. 
Geimaro AR., 1995, Mack Publishing Company, Easton, PA. Formulations for 
parenteral administration may, for example, contain excipients, sterile water, or saline, 
polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, or 
hydrogenated napthalenes. Biocompatible, biodegradable lactide polymer, 
lactide/glycolide copolymer, or polyoxyethylene-poiyoxypropylene copolymers may 
be used to control the release of the compounds* Other potentially useful parenteral 
delivery systems for agonists of the invention include ethylenevinyl acetate 
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copolymer particles, osmotic pumps, implantable infiision systems, and liposomes. 
Formulations for inhalation may contain excipients, or example, lactose, or may be 
aqueous solutions containing, for example, polyoxyethylene-9-lauxyl ether, 
glycocholate and deoxycholate, or may be oily solutions for administration in the 
5 form of nasal drops, or as a gel. 

Compounds 

In general, novel drugs for the treatment of aberrant cholesterol levels and/or 
CVD are identified from large libraries of both natural product or synthetic (or semi- 

10 synthetic) extracts or chemical libraries according to methods known in the art. Those 
skilled in the field or drug discovery and development will understand that the precise 
source of test extracts or compounds is not critical to the screening procedure(s) of the 
invention. Accordingly, virtually any niunber of chemical extracts or compounds can 
be screened using the exemplary methods described herein. Examples of such 

15 extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- or 
animal-based extracts, fermentation broths, and synthetic compoimds, as well as 
modification of existing compounds. Numerous methods are also available for 
generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of any 
number of chemical compoimds, including, but not limited to, saccharide-, lipid-, 

20 peptide-, and nucleic acid-based compounds. Synthetic compound libraries are 
commercially available from Brandon Associates (Merrimack, NH) and Aldrich 
Chemical (Milwaukee, WI). Alternatively, libraries of natural compounds in the form 
of bacterial, fungal, plant, and animal extracts are commercially available from a 
number of sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor 

25 Branch Oceangraphics Institute (Ft. Pierce, FL), and PharmaMar, U.S.A. (Cambridge, 
MA), In addition, natural and synthetically produced libraries are produced, if 
desired, according to methods known in the art, e,g., by standard extraction and 
fractionation methods. Furthermore, if desired, any library or compoimd is readily 
modified using standard chemical, physical, or biochemical methods. 

30 In addition, those skilled in the art of drug discovery and development readily 

understand that methods for dereplication (e.g., taxonomic dereplication, biological 
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dereplication, and chemical dereplication, or any combination thereoQ or the 
elimination of replicates or repeats of materials already known for their HDL-raising 
and anti-CVD activities should be employed whenever possible. 

When a crude extract is found to have cholesterol-modulating or anti-CVD 
activities or both, fimher fractionation of the positive lead extract is necessary to 
isolate chemical constituent responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization and 
identification of a chemical entity within the crude extract having cholesterol- 
modulating or anti-CVD activities. The same in vivo and in vitro assays described 
herein for the detection of activities in mixtures of compounds can be used to purify 
the active component and to test derivatives thereof. Methods of fi:actionation and 
purification of such heterogeneous extracts are known in the art. If desired, 
compounds shown to be useful agents for the treatment of pathogenicity are 
chemically modified according to methods known in the art. Compounds identified 
as being of therapeutic value are subsequently analyzed using any standard animal 
model of diabetes or obesity known in the art. 

It is understood that compounds that modulate activity of proteins that 
modulate or are modulated by ABCl are useful compounds for modulating 
cholesterol levels. Exemplary compounds are provided herein; others are known in 
the art. 

* Compounds that are structurally related to cholesterol, or that mimic ApoAl or 
a related apolipoprotein, and increase ABCl biological activity are particularly useful 
compounds in the invention. Other compounds, known to act on the MDR protein, 
can also be used or derivatized and assayed for their abihty to increase ABCl 
biological activity. Exemplary MDR modulators are PSC833, bromocriptine, and 
cyclosporin A. 

Screening patients having low HDTrC 

ABCl expression, biological activity, and mutational analysis can each serve 
as a diagnostic tool for low HDL; thus determination of the genetic subtyping of the 
ABCl gene sequence can be used to subtype low HDL individuals or families to 
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detennine whether the low HDL phenotype is related to ABCl function. This 
diagnostic process can lead to the tailoring of drug treatments according to patient 
genotype, including prediction of side effects upon administration of HDL increasing 
drugs (referred to herein as pharmacogenomics). Pharmacogenomics allows for the 
selection of agents (e.g., drugs) for therapeutic or prophylactic treatment of an 
individual based on the genotype of the individual (e.g., the genotype of the individual 
is examined to determine the ability of the individual to respond to a particular agent). 

Agents, or modulators which have a stimulatory or inhibitory effect on ABCl 
biological activity or gene expression can be administered to individuals to treat 
disorders (e.g., cardiovascular disease or low HDL cholesterol) associated with 
aberrant ABCl activity- In conjxmction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an individual's genotype and that 
individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in efficacy of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of 
the pharmacologically active drug. Thus, the pharmacogenomics of the individual 
permits the selection of effective agents (e.g., drugs) for prophylactic or therapeutic 
treatments based on a consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine appropriate dosages and 
therapeutic regimens. Accordingly, the activity of ABCl protein, expression of 
ABCl nucleic acid, or mutation content of ABCl genes in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 
persons (Eichelbaum, M., Clin. Exp. Phannacol. Physiol., 23:983-985, 1996; Linder, 
M. W., Clin. Chem., 43:254-266, 1997). In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body acts on drags (altered drug 
metabolism). Altered drug action may occur in a patient having a polymorphism 
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(e.g., an single nucleotide polymorphism or SNP) in promoter, intronic, or exonic 
sequences of ABCl. Thus by determining the presence and prevalence of 
polymorphisms allow for prediction of a patient's response to a particular then^eutic 
agent In particular^ polymorphisms in the promoter region may be critical in 
5 determining the risk of HDL deficiency and CVD. 

In addition to the mutations in the ABC2 gene described herein, we have 
detected polymorphisms in the hmrrni ABCI gene (Fig. 1 1). These polymorphisms 
are located in promoter, intronic, and exonic sequence of ABCL Using standard 
methods, such as direct sequencing, PGR, SSCP, or any other poiymorphism- 

10 detection system, one could easily ascertain whether these polymorphisms are present 
in a patient prior to the establishment of a drug treatment regimen for a patient having 
low HDL, cardiovascular disease, or any other ABCl -mediated condition. It is 
possible that some these polymorphisms are, in fact, weak mutations, individuals 
harboring such mutations may have an increased risk for cardiovascular disease; thus, 

15 these polymorphisms may also be useful in diagnostic assays. 

Association Studies of ABCl Gene Variants and HDL Levels or Cardiovascular 
Disoa&s 

The following polymorphisms have been examined for their effect on 
20 cholesterol regulation and the predisposition for the development of cardiovascular 
disease. 

Substitution of G for A at nucleotide "1045 [G('I045)A], This variant is in 
complete linkage disequilibrium with the variant at -738 in the individuals we have 
sequenced, and thus any potential phenotypic effects cunently attributed to the variant 
25 at -738 may at least in part be due to changes at this site. 

Substitution of G for A at nucleotide -738 [G(-738)A]. This variant has been 
found at very high frequencies in populations selected for low HDL cholesterol or 
premature coronary artery disease. 

Insertion of a G nucleotide at position -4 [G ins (~4)]. This variant has been 
30 associated with less coronary artery disease in its carriers than in non-carriers. 

Substitution ofaCfor Gat nucleotide -57 [G('57)CJ. This variant is in 
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complete linkage disequilibrium with the variant at -4 in the individuals we have 
sequenced, and thus the phenotypic effects cuirentiy iattributed to the variant at -4 may 
at least in part be due to changes at this site. 

Substitution of A for G at nucleotide 730 (R219K). We have found carriers to 
5 have significantly less cardiovascular disease. 

Substitution of C for Tat nucleotide 2270 (V399A), Within the French 
Canadian population, this variant has only been found in individuals fix>m the low 
HDL population. It has also been seen in individuals with low HDL or premature 
coronary artery disease in individuals of Dutch ancestry. 

10 Substitution of A for G at nucleotide 2385 (V771M). This variant has been 

found at an increased frequency in a Dutch population selected for low HDL and at an 
increased frequency in a population selected for premature coronary artery disease 
compared to a control Dutch population, indicating earners of this variant may have 
reduced HDL and an increased susceptibility to coronary artery disease. 

15 Substitution of C for A at nucleotide 2394 (T774P), This variant has been seen 

at lower fequencies in populations with coronary artery disease or low HDL than in 
individuals without. 

Substitution of C for Gat nucleotide 2402 (K776N), This variant has been 
found at a significantly lower frequency (0.56% vs. 2.91%, p=0.02) in a coronary 

20 artery disease population vs. a control population of similar Dutch background. 

Substitution of C for G at nucleotide 3590 (El 17 2D). This variant is seen at 
lower frequencies in individuals with low HDL and in some populations with 
premature coronary artery disease. 

Substitution of A for G at nucleotide 4384 (R1587K). This variant has been 

25 found at decreased frequencies in the 1/3 of individuals with the highest HDL levels 
in our large Dutch coronary artery disease population (p^O.036), at increased 
frequencies in those with HDL cholesterol <0.9 mmol/L (p<0.0001) and at decreased 
frequencies in the cohorts with HDL cholesterol >1.4 mmol/L in both this population 
(p«0.02) and the Dutch control population (p=0.003). 

30 Substitution of G for C at nucleotide 5266 (SI 73 1 C), Two FHA individuals 

who have this variant on the other allele have much lower HDL cholesterol 
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(0, 1 55+0.025) than the FHA individuals in the family who do not have this variant on 
the other allele (0.64±0.14, p=0.0009). This variant has also been found in one 
general population French Canadian control with HDL at the 8th percentile (0.92) and 
one French Canadian individual from a population selected for low HDL and 
5 coronary disease (0.72). 

Substitution of G for A at nucleotide -1113 {A('1113)Gf This variant has 
been seen at varying frequencies in populations distinguished by their HDL levels. 

Additional polymorphisms that may be associated with altered risk for 
cardiovascular disease or altered cholesterol levels are as follows: 
10 Substitution of G for A at nucleotide 2723 (I883M). This variant has been 

seen at a much higher frequency in individuals of Dutch ancestry with premature 
coronary artery disease. 

Insertion of 4 nucleotides (CCCT) at position -11 8L 

Substitution of C for A at nucleotide -479 (linkage disequilibrium with -518). 
15 Substitution of G for A at nucleotide -380, 

Other Embodiments 
All publications mentioned in this specification are herein incorporated by 
reference to the same extent as if each independent publication was specifically and 
20 individually indicated to be incorporated by reference. 

While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further modifications. 
This application is intended to cover any variations, uses, or adaptations following, in 
general, the principles of the'invention and including such departures from the present 
25 disclosure within known or customary practice within the art to which the invention 
pertains and may be applied to the essential features hereinbefore set forth. 
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What we claim is : 

1* A substantially pure ABCl polypeptide having ABCl biological activity. 

5 2. The substantially pure ABCl polypeptide of claim 1, wherein said ABCl 

polypeptide is human ABC 1 . 

3. The substantially pure ABCl polypeptide of claim 1, wherein said 
polypeptide comprises amino acids 1 to 60 of SEQ ID NO: K 

10 

4. The substantially pure ABCl polypeptide of claim 1, wherein said 
polypeptide comprises amino acids 61 to 2261 of SBQ ID NO: L 

5. The substantially pure ABCl polypeptide of claim 1> wherein said 
15 polypeptide comprises amino acids 1 to 2261 of SEQ ID NO: 1. 

6. A substantially pure ABCl polypeptide comprising amino acids 1 to 60 of 
SEQ ID NO: 1/ 

20 7. A substantially pure ABCl polypeptide comprising amino acids 61 to 2261 

of SEQ ID NO: 1. 

8, A substantially pure ABCl polypeptide comprising amino acids 1 to 2261 
of SEQ ID NO: 1. 

25 

9. A substantially pure nucleic acid molecule that hybridizes at high 
stringency conditions to nucleotides 75 to 254 of SEQ ID NO: 2 and encodes a 
polypeptide having ABCl biological activity. 

30 1*^- A substantially pure nucleic acid molecule encoding an ABCl 

polypeptide having ABCl biological activity. 
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1 1. The substantially pure nucleic acid molecule of claim 9 or 10, wherein 
said nucleic acid molecule comprises nucleotides 75 to 254 of SEQ ED NO: 2, 

12. The substantially pure nucleic acid molecule of claim 9 or 10, wherein 
5 said nucleic acid molecule comprises nucleotides 255 to 6857 of SEQ ID NO: 2. 

1 3 . The substantially pure nucleic acid molecule of claim 9 or 1 0, wherein 
said nucleic acid molecule comprises nucleotides 75 to 6857 of SEQ ED NO: 2. 

10 14. An expression vector comprising the nucleic acid molecule of claim 9. 

15. A cell expressing the nucleic acid molecule of claim 9. 

16. A non-human mammal expressing the nucleic acid molecule of claim 9. 

15 

17. A substantially pure nucleic acid molecule comprising nucleotides 75 to 
254 of SEQ ID NO: 2. 

18. A substantially pure nucleic acid molecule comprising nucleotides 255 to 
20 6857ofSEQIDNO:2. 

19. A substantially pure nucleic acid molecule comprismg nucleotides 75 to 
6857ofSEQIDNO:2. 

25 20. A substantially pure nucleic acid molecule comprising at least thirty 

consecutive nucleotides corresponding to nucleotides 7015-7860 of SEQ ED NO: 2. 

21. The substantially pure nucleic acid molecule of claim 20, wherein said 
nucleic acid molecule comprises nucleotides 7015-7860 of SEQ ID NO: 2. 

30 
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22. A substantially pure nucleic acid molecule that hybridizes at high 
stringency to a probe comprising nucleotides 7015-7860 of SEQ ID NO: 2. 

23. A method of treating a human having low HDL cholesterol or 

5 cardiovascular disease, said method comprising administering to said human an 
ABCl polypeptide, or cholesterol-regulating fragment thereof. 



24. The method of claim 23, wherein said ABCl polypeptide has the 
sequence of SEQ ID NO: 1 . 

10 

25. The method of claim 23, wherein said ABCl polypeptide comprises a 
mutation that increases its stability. 

26. The method of claim 23, wherein said ABCl polypeptide comprises a 
15 mutation that increases its biological activity. 

27. A method of treating a human having low HDL cholesterol or 
cardiovascular disease, said method comprising administering to said human a nucleic 
acid molecule encoding an ABCl polypeptide or a cholesterol-regulating fragment 

20 thereof. 



28. The method of claim 27, wherein said ABCl polypeptide has the amino 
acid sequence of SEQ ID NO: 1. 

25 29. The method of claim 27, wherein said ABCl polypeptide comprises a 

mutation that increases its stability. 

30. The method of claim 27, wherein said ABCl polypeptide comprises a 
mutation that increases its biological activity. 

30 
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10 



15 



30 



31. The method of claim 30, wherein said biological activity is regulation of 
cholesteroL 

32. The method of claim 27, wherein said himian has low HDL cholesterol 
levels relative to normal. 



33. A method of increasing ABCl biological activity in a human, said method 
comprising administering to said human a nucleic acid molecule that hybridizes at 
high stringency conditions to nucleotides 75 to 254 of SEQ ID NO: 2 and encodes a 

10 polypeptide having ABCl biological activity. 

34. The method of claim 33, wherein said human has a disease selected jfrom 
the group consisting of Alzheimer's disease, Niemann-Pick disease, Huntington's 

2^ disease, x-linked adrenoleukodystrophy, and cancer. 



15 



35. A method of increasing ABCl biological activity in a human, said method 
comprising administering to said human a compoimd that increases ABC 1 biological 
activity. 



20 36. The method ofclaim 35, wherein said human has a disease selected from 

35 the group consisting of Alzheimer's disease, Nicmann-Pick disease, Huntington's 

disease, x-linked adrenoleukodystrophy, and cancer. 

37. A method of preventing cardiovascular disease in a human, said method 

40 

25 comprising administering to said human an expression vector comprising an ABC J 

nucleic acid molecule operably linked to a promoter, said ABCl nucleic acid molecule 
encoding an ABCl polypeptide having ABCl biological activity. 

45 



50 -72- 



55 



wo 00/55318 PCT/IBOO/00532 



38. A method of preventing or ameliorating the effects of a disease-causing 
mutation in an ABCl gene in a human> said method comprising introducing into said 
human an expression vector comprising a promoter operably linked to an 

nucleic acid molecule encoding an ABCl polypeptide having ABCl biological 
5 activity. 

39. A method of treating or preventing cardiovascular disease in an animal, 
said method comprising administering to said animal a compoimd that mimics the 
activity of wild-type ABCl . 

10 

40. The method of claim 39, wherein said animal is a human. 

41 . A method of treating or preventing cardiovascular disease in an animal, 
said method comprising administering to said animal a compound that modulates the 

15 biological activity of ABCl. 

42. The method of claim 41, wherein said animal is a human. 

43. The method of claim 41 , wherein said compound is selected from a group 
20 consisting of protein kinase A, protein kinase C, vanadate, okadaic acid, IBMXl , 

fibrates, ji-estradiol, arachidonic acid derivatives, WY-14,643, LTB4, 8(s)HETE, 
thiozoiidinedione antidiabetic drugs, 9-HODE, 13-HODE, nicotinic acid. HMG CoA 
reductase inhibitors, and compounds that increase PPAR-mediated ABCl expression. 

25 44. The method of claim 23, 27, 39, or 41 . wherein said cardiovascular 

disease is coronary artery disease, cerebrovascular disease, coronary restenosis, or 
peripheral vascular disease. 
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45. A method for determining whether a candidate compound is useful for 
modulating cholesterol levels, said method comprising the steps of: 

(a) providing a chicken comprising a mutation in an ABCl gene; 

(b) administering said candidate compound to said chicken; and 
5 (c) measuring ABCl biological activity in said chicken, 

wherein altered ABCl biological activity, relative to a WHAM chicken not contacted 
with said compound, indicates that said candidate compound modulates cholesterol 
levels. 

10 46. The method of claim 45, wherein said ABCl biological activity is 

transport of cholesterol 

47. A method for determining whether a candidate compound modulates 
ABCl biological activity, said method comprising the steps of: 

15 (a) providing a cell expressing an ABCl polypeptide comprising amino acids 

lto60ofSEQIDNO:l; 

(b) contacting said cell with said candidate compound; and 

(c) measuring ABCl biological activity of said cell, 

wherein altered ABCl biological activity, relative to a cell not contacted with said 
20 compound, indicates that said candidate compound modulates ABCl biological 
activity. 

48. A method for determining whether a candidate compound modulates 
ABCl expression, said method comprising the steps of: 

23 (a) providing a cell expressing an ABC J gene comprising nucleotides 75 to 

254ofSEQIDNO:2; 

(b) contacting said cell with said candidate compoimd; and 

(c) measuring ABCl expression of said cell, 

wherein altered ABCl expression, relative to a cell not contacted with said 
30 compound, indicates that said candidate compound modulates ABCl expression. 
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49. A method for determming whether a candidate compound modulates 
ABCl expression, said method comprising the steps of: 

(a) providing a nucleic acid molecule comprising an ABCl promoter operably 
linked to a reporter gene; 

5 (b) contacting said nticleic acid molecule with said candidate compound; and 

(c) measuring expression of said reporter gene, 
wherein altered reporter gene expression, relative to a control not contacted with said 
compound, indicates that said candidate compound modulates ABCl expression. 

10 . 50. The method of claim 49, wherein said promoter comprises 50 consecutive 

nucleotides selected from nucleotides I to 8238 of SEQ ID NO: 14. 

51. The method of claim 50, wherein said promoter comprises a binding site 
for a transcription factor selected from a group consisting of steroid response element 

15 binding proteins, peroxisomal proliferation-activated receptors, retinoid X receptors, 
and RAR-related orphan receptors. 

52. A method for determining whether a candidate compoimd modulates 
ABCl biological activity, said method comprising the steps of: 

20 (a) providing an ABCl polypeptide comprising amino acids 1 to 60 of SEQ ID 

NO: 1; 

(b) contacting said polypeptide with said candidate compound; and 

(c) measuring ABCl biological activity, 

wherein a change in ABCl biological activity, relative to a control not contacted with 
25 said compoimd, indicates that said candidate compound modulates ABC 1 biological 
activity. 
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53. A method for determining whether a candidate compomd modulates 
ABCl expression, said method comprising the steps of: 

(a) providing an ABCl polypeptide comprising amino acids 1 to 60 of SEQ ID 

NO: 1; 

5 (b) contacting said polypeptide with said candidate compoimd; and 

(c) measuring expression of said ABCl polypeptide, 
wherein a change in expression of said ABCl polypeptide, relative to a control not 
contacted with said compound, indicates that said candidate compound modulates 
ABCl expression. 

0 

54. A method for determining whether candidate compound modulates ABCl 
biological activity, said method comprising the steps of; 

(a) providing an ABCl polypeptide comprising amino acids 1 to 60 of SEQ ID 

NO: 1; 

5 (Ij) contacting said polypeptide with said candidate compound; and 

(c) measuring binding of said ABCl polypeptide to said candidate compound, 
wherein binding of said ABCl polypeptide to said compound indicates that said 
candidate compound modulates ABCl biological activity. 

0 55. A method for determining whether candidate compound modulates ABCl 

biological activity, said method comprising the steps of: 

(a) providing (i) an ABCl polypeptide comprising amino acids 1 to 60 of SEQ 
ID NO: 1, and (ii) a second polypeptide that interacts with said ABCl polypeptide; 

(b) contacting said polypeptides with said candidate compound; and 

5 (c) measuring interaction of said ABCl polypeptide with said second 

polypeptide, wherein an alteration in the intemction of said ABCl polypeptide with 
said second polypeptide indicates that said candidate compound modulates ABCl 
biological activity. 
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56. A method for detenniiung whether a candidate compoimd increases the 
stability or decreases the regulated catabolism of an ABCl polypeptide, said method 
comprising the steps of: 

(a) providing a cell comprising an ABCl polypeptide comprising amino acids 
5 1 to 60ofSEQIDNO: 1; 

(b) contacting said cell with said candidate compound; and 

(c) measuring the half-life of said ABCl polypeptide, 

wherein an increase in said half-life, relative to a control not contacted with said 
compound, indicates that said candidate compound increases the stability or 
10 decreases the regulated catabolism of an ABCl polypeptide. 

57. A method for determining whether a candidate compound modulates 
ABCl biological activity, said method comprising the steps of: 

(a) providing an ABCl polypeptide in a lipid membrane; 

(b) contacting said polypeptide with said candidate compound; and 

(c) measuring ABC 1 -mediated Upid transport across said lipid membrane, 
wherein a change in lipid transport, relative to a control not contacted with said 
compound, indicates that said candidate compoimd modulates ABCl biological 
activity. 

58. The method of claim 49, 52, 53, 54, 55, or 57, wherein said ABCl 
polypeptide is in a cell-free system. 

59. Themethodofclaim49, 52, 53, 54, 55,or57,whereinsaidABCl 
polypeptide is in a cell. 

60. The method of claim 59, wherein said cell is from a WHAM chicken. 

61. The method of claim 59, wherein said cell is in a human or in a non- 
human mammal. 
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62. The method of claim 61, wherein said animal is a WHAM chicken. 

63. The method of claim 52, wherein said biological activity is transport of 
lipid or interleukin-1. 

64. The method of claim 62, wherein said lipid is cholesterol. 

65. The method of claim 64, wherein said cholesterol is HDL-cholesterol 

66. The method of claim 52, wherein said biological activity is binding or 
hydrolysis of ATP by the ABC 1 polypeptide. 

67. A method for determining whether a patient has an increased risk for 
cardiovascular disease, said method comprising determining whether an ABC I gene 
of said patient has a mutation, wherein a mutation indicates that said patient has an 
increased risk for cardiovascular disease. 

68. A method for determining whether a patient has an increased risk for 
cardiovascular disease, said method comprising measuring ABCl biological activity 
in said patient or in a cell from said patient, wherein increased or decreased levels in 
said ABCl biological activity, relative to normal levels, indicates that said patient has 
an increased risk for cardiovascular disease. 

69. A method for determining whether a patient has an increased risk for 
cardiovascular disease, said method comprising measuring ABCl expression in said 
patient or in a cell firom said patient, wherein decreased levels in said ABCl 
expression relative to normal levels, indicates that said patient has an increased risk 
for cardiovascular disease. 
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70. The method of claim 69, wherein said ABCl expression is determined by 
measuring levels of ABCl polypeptide. 

71 . The method of claim 69, wherein said ABCl expression is determined by 
5 measuring levels of ABCl RNA. 

72. A non-human mammal comprising a transgene comprising a nucleic acid 
molecule encoding a dominant-negative ABCl polypeptide. 

10 73. A cell isolated from a non-human mammal comprising a transgene 

comprising a nucleic acid molecule encoding an ABCl polypeptide having biological 
activity. 

74. A method for determining whether a candidate compound decreases the 
15 inhibition of a dominant-negative ABCl polypeptide, said method comprising the 

steps of: 

(a) providing a cell expressing a dominant-negative ABCl polypeptide; 

(b) contacting said cell with said candidate compound; and 

(c) measuring ABCl biological activity of said cell, 

20 wherein an increase in said ABCl biological activity, relative to a cell not contacted 
with said compound, indicates that said candidate compound decreases the inhibition 
of a dominant-negative ABCl polypeptide. 

75. A method for determining whether a person has an altered risk for 

25 developing cardiovascular disease, comprising examining the person's ABCl gene for 
polymorphisms, wherein the presence of a polymorphism associated with 
cardiovascular disease indicates the person has an altered risk for developing 
cardiovascular disease. 

30 
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76. A method for predicting a person's response to a drug» comprising 
determining whether the person has a polymorphism in an ABCl gene that alters the 
person's response to said drug. 

77. A method for predicting a person's response to a drug, comprising 
determining whether the person has a polymorphism in an ABCl promoter that alters 
the person's re^onse to said drug. 

78. A method for altering ABCl expression in a cell, said method comprising 
contacting said cell with a compound selected from a group consisting of fibrates, 
P-estradiol, arachidonic acid derivatives, WY- 14,643, LTB4, 8(s)HETE, 
thiozolidinedione antidiabetic drugs, 9-HODE, 13-HODE, nicotinic acid, HMG Co A 
reductase inhibitors, and compounds that increase PPAR-mediated ABCl expression. 

79. A pharmaceutical composition comprising (i) a nucleic acid molecule that 
hybridizes under high stringency conditions to nucleotides 75 to 254 of SEQ ID NO: 
2 and encodes a polypeptide having ABCl biological activity; and (ii) a 
pharmaceutically acceptable carrier. 

80. A nucleic acid that hybridizes under high stringency conditions to 
nucleotides 1 to 8236 of SEQ ID NO: 1 4. 

81. A nucleic acid comprising a region that is 80% identical to at least thirty 
contiguous nucleotides of nucleotides 1 to 8236 of SEQ ID NO: 14. 
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82. A method for determining whether candidate compound modulates ABCl 
biological activity, said method comprising the steps of: 

(a) providing an ABC 1 polypeptide; 

(b) contacting said polypeptide with cholesterol and said candidate compound; 

5 and 

(c) measuring binding of said cholesterol to said ABCl polypeptide, wherein 
binding of said cholesterol to said ABCl polypeptide indicates that said candidate 
compound modulates ABCl biological activity, 

10 83. The method of claim 82, wherein said cholesterol is HDL cholesterol. 

84. The method of claim 82, wherein said method is performed in a cell free 

assay. 

15 85. The method of claim 82, wherein said ABCl polypeptide comprises 

amino acids 1 to 60 of SEQ ID NO: 1 . 

86. the method of claim 82, wherein said cholesterol or said ABCl 
polypeptide is detectably labeled. 
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SEQ ID NO: 1 

MACWPQLRLLIjWKNI*TFRRRQTCQLLLEVAWPLFIFLILIS\nil,SYPPYEOHECHF?NKAMPSAGTLPWVQ 

GIICNANNPCF^yPTPGEAPGWGNFNKSIVARLFSDARRLLLYSOKDTSMKDMRKA-yRTLQQIKKSSSNL 

KLQDFLVDNET?SGFLVHNLSLPKSTVDKMLRADVILHKVFL0GyQlJn-TSLCNGSKS2EMI^^^ 

LCGI/PREKLAAASaVLRSWMD ILK? I IJITLNSTSPFP SKELAEATKTLLHSLGTIACSIiPSMRSWSDM 

VMFLTMVNSSSSSTQIYQAVSHIVCGKPEGGGLKIKSLNWYEDNWVKALFGGNGTEEDAETFYDKSTTPYC 

NDLMKNLESS?liSR2IMKALKPLLVGKILYT?DTPAT?,QVMAEVNKTFQEIAVFHDLSGMWEELSPKIWTF 

MENS OEMD LVRMLLDS ROmaHFWEQQLDGLDin'AQD I \-AFIAiaiPEDVOSSNGS VVTls' AFl^ 

ISRFMECVNIA-KLEPIATBWLINKSMELLDSaKFWAGlVPTGITPGSIELPHHVKVKIRMDIDm^ERTNK 

TKDGYWDPGPKADPFEDMRYVWGGFAYLQDWEQAI I P.VLTGTEKKTGVYMQQHPYPC YVDDI FLRVMSRS 

MPLFMTlAWrVSVAVIIKGIVYEKET^LKETMRIMGLDNSILWPSWFISSLIPLLVSAGLLVVILKLGNLl, 

PYSDPSWFVFLS VPAWTI LQCFLI STLFSRANLAAACCGI I YFTIiYLPYVLCVAKODYVGFTLKIFASL 

LSPVAFGFGCSYFALFEEQGIGVQWDNl.FESPVEEDGFNLTTSVSMMLFDTFLYG^T^rX'WYIEAVFPGQYGI 

PP,PWYFPCTKSYWFGEESDEKSKPGSNQKRISEICMEEEPTHLKIiGVSIQNLVK^THDGMICVAVDGI^ 

YEGQITSFr.GP3JGACKTTTMSILTGLFPPTSGTAYII*Gia3ITlSEMSTIRQNLGVCPQHNVI,FDMLTVEEHI 

WFYARLKGIiSEKHVKAEMEQMALDVGLPSSIOjKSKTSQLSGGMQRKXjSVAIAFVGGSKVVILDEPTAGVDP 

YSRRGIWELLLKYROGRTI ILSTKHMDEADVLGDRIAI I SKGKLCCVGS SLFLKNQLGTGYYLTLVKKDVE 

SSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHSSDTLTIDVSAISNLXRKHVSSAJILVEDIGHELTYV 

lp yeaakegafvelfheiddrlsdlg issyg i settlee i flkvaeesgvdaetsdgtlparrkrrafgdk 
csclrpfteddaadpndsdidpesretdllsgmdgkgsyovkgwkltqqqfvallkkrlliarrsrkgffa 
qivlpavfvcialvfslivppfgkypslelqpwmyneqytfvsndapedtgtlehinaltkdpgfgtrcme 
gnpipdtpcqageeekttapvpqtimdlfqkgkwtmonpspacqcssdkikkmlpvcppgagglpppqrkq 
kt-adilodltgrnisdylvktyvqiiakslkmkrmvttefryggfslgvsntoalpfsqevndalxomkkhl 

KL AKD S S ADRFt^S LGRFMTGLDTRlOTn/KVmFKNKGWHAI &SFLm;iNNA2 LRANL OKGENP SH YC I TAFN 
KPLNLTKQQLSEVALMTTSVDVLVS I CVIFAMS F VPASPWFLIQERVSKAKHLQF X SGVKPVX YWLSNFV 
WDMCNYWPATLVI I IFICFQQKSYVSSTNLPVLALLLLLYGWS ITPLMYPASFVFKI PSTAYWLTSVWL 
FIGINGSVATFVLELFTDNKLmiNDILKSVFLIFPKFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 
DLVGRUrLFAMAVEGWFPLrT\'LIQVRFFIRPRPWAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKEL 
TKIYHRKRKPAVaRICVGlPPG£CFGLIK3VNGAGKSSTFKMLTGDTTVTHGDAFLNK^ISILSNIHEVHQl^^ 
GYCPQFDAITELLTGREHVEFFALLRGVPSKEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMAL 
IGGPPWFLDEPTTG^C>PKARR^LWlaCALSV\^CEGRSWLTSHSMEECEALCTRMAIMVNGRFHCLGSVQK 
LKNRFGDGYTIWRr AGSWPDLKPVQDFFGLAFPGSVl,KSKHRNMLOYQLPSSLS SLARI FS ILSOSKKRL 
H I ED YSVSQTTliDOVFVNFAKrJOSDDDHLKDLSLHKNQTWDVAVLTS FLQDEKVKES YV* 
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SEQIDNO:2 

GTCCCXGCTGTGAGCTCTGGCCGCTGCCTTCCAGGGCTCCCtSAGCCACACSCTGGGGGTG 
CTGGCTGAGGGAACATGGCTTGTTCGCCTCAGCTGAGGTTGCTGCTGTGGAAGAACCTCA 
CTTTCAGAAGAAGACAAACATGrrCAGCTGTTACTGGAAGTGGCCTOGCCTCTATTTATCT 
TCCTGATCCTGATCTCTGTTCGGCTGAGCTACCCACCCTATGA^CAACATGAATGCCATT 
TTCCAAATAAAGCCATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGArTATCTGTA 
ATGCCAACAACCCCTGTTTCCGTTACCCGACTCCTGGGGXGGCTCCCGGAGTTGTTGGAA 
ACTTTAACAAATCCATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCrTTTATACA 
GCCAGAAAGACACCAGCATGAAGGACATGCGCAAAGTTCTGACAACATTACAGCAGATCA 
AGAAATCCACCTCAAACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTG 
GGTTCCTGTATCACAACCTCTCTCTCCCAAACTCTACTGTGGACAAGATGCTGAGGGCTG 
ATGTCATTCTCCACAAGGTATTTTTGCAAGGCTACCAGTTACATTTGACAAGTCTGTGCA 
ATGGATCAAAATCAGAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTG 
CSCCTACCAAGGGAGAAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCT^CATGGACATCC 

tgaagccaatcctgagaacactaaactctacatctcccttcccgagcaagg;^gctggctg 
aagccacaaaaacattgctgcatagtcttgggactctggcccaggagctgttcagcatga 
gaagctggagtgacatgcgacaggrggtgatgtttctgaccaatgtgaacagctccagct 
cctccacccaaatctaccaggctgtgtctcgtattgtctgcgggcatcccgaqggagggg 
ggctgaagatcaagtctctcaactggtatgag6acaacaactacaaagccctctttggag 
gcaatggcactgaggaagatgctgaaaccttctatgacaactctacaactccttactgca 
atgatttgatgaagaatttgqagtctagtcctctttcccgcattatctgaaaagctctga 
agccgctgctcgttgggaagatcctgtatacacctgacactccagccacaaggcaggtca 
tggctgaggtgaacaa3accttccaggaactggctgtgttccatgatctggaaggcatgt 
gggaggaactcagcccraagatctggaccttcatggagaacagccaagaaatggaccttg 

TCCGGATGCTGTTGGACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCT 
TAGATTGGACAGCCCVi^GACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGT 
CCAGTAATGGTTCTGrGTACACCTGGAGAGA^VGCTTTCAACGAGACTAACCAGGCAATCC 
GGACCATATCTCCCT7CATGGAGTG?GTCAACCTGAACAACCTAGAACCCATAGCAACAG 
AAGTCTGGCTCATCAACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGC7GGTA 
rrGTGTTCACTGGAArrACTCCAGGCAGGATrCACCTGCCCCATCATGTCAAGTACAAGA 
TCCGAATGGACATTGACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACC 
CTGGTCCTCGAGCTGACCCCTTTGA5GACATGCGGTACGTCTGGGGGGGCTTCGCCTACT 
TGC AGGATC'TGGTGQAGC AGGCAATCAT G AGGGTGCTGACGGGCACCGAGAAGAAAACTG 



Fig. 9B 



wo 00/55318 



28 / 68 



PCTABOO/00532 



GTGTCTAT ATGCAACAGATG CCCTATCCCTG TTACGTTG ATGACATCTTTCTGCGGGTGA 

TGAGCCGGTCAATGCCCCTCTTCATGftCGCTGGCCTGGATTTACTCAGTGGCTQTGATCA 

TCAAGGGCATCGTGTATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCC 

TGGAa^CAGCATCCTCTGGTTTAGCTGGTT CATT AGTAGCC7CATTC CTCTTCTTGTG A 

GCGCTGGCCTGCTACTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCA 

GCGTGGTGTTTGTCTTCCTGTCCCTGTTTGCTGTGGTQACAATCCTGCAGTGCTTCCTGA 

TTAGCACACTCTTCTCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCA 

CGCTGTACCTGCCCTACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCA 

AGATGTTCGCTAGCCTGCTGTCTCCTSTaGCTTTTGGGTTTGGCTGTGAGTACTTTGCCC 

TTTTTGAGGAGCAGGGCATTGQAGT3CAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGG 

AAGATG3CTTCAATCTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATG 

GGGTGATGACCTGGTACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCT 

GGTATTTTCCTTGCACCAAGTCCTACTGGrrrGGCGAGGAAAGTGATGAGAAGAGCCACC 

CTGGXTCCAACCAGAAGAGAATATCAGAAATCTGCATGGAGGAGGAACCCACCCACTTGA 

AGCT35GCGTGTCCATTCAGAACCTGGTAAAAGTC7ACCGAGATGGGATGAAGGTGGCTG 

TCCATGSCCTGGCACTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATG 

GAGCGGGGAAGACGACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCA 

CCGCCTACATCCTGQGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGG 

GGGTCTGTCCCCAGCATAACGTGCTGTTTGACATGCTGACTGTCGAAGAACACATCTGGT 

TCTATGCCCGCTTGAAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGG 

CCCTGGATGTTGGTTTGCCATCAAGCAAGCTGAAAAGCAAAACAAGCCAGCTGTCAGGTG 

GAATGCAGAGAAAGCTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTC 

tGGATGAACCCACAGCTGGTGTGGACCCTTACTCCCGCAGGGGAATATGGGAGCTGCTGC 

TGAAATACCGACAAGGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACG 

TCCTGGGGGACAGGATTGCCATCATCfCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCC 

TGTTTCTGAAGAACCAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGG 

AATCCTCCCTCAGTTCCTGCAGAAACAGrAGTAGCACTGTGTCATACCTGAAAAAGGAGG 

ACAGTGTTTCTCAGAGGAGTTCTGATGCTGGCCTGGQCAGCGACCATGAGAGTGACACGC 

TGACCATCGATGTCTCTGCTATCTCC.Vi.CCTCATCAGGAAGCATGTGTCTGAAGCCCGGC 

TGGTGGAAGACATAGGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGG 

GAGCCTTTGTGGAACTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTA 

GTTATGGCATCTCAGAaACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTG 

GGGTGG ATGCTG AGACCTCAG ATGGTACGTTGCCAGCAAGACGAAACAGGCGGG CCTTCG 

GGGACAAGCAGAGCTGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATT 
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CTGACATAGACCCAGAATCCAGAGAGACAGACTTGCrCAGrGGGATGGATC-^CAAAGGGT 

CCTACCAGGTGAAAGGCTaGAAACTTACACAGCAACAaTTTGTGGCCCrrrrGTGGAACA 

GACTGCTAATTGCCAGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTC'TTGCCAGCTC 

TGTTTGrCTGCATTGCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCA 

GCCTGGAACTTCAGCCCTGGATGTACAACGAACAGTACACATITGTCAGCAATGATGCTC 

CTGAGGACACGGGAACCCTCGAACTCTTAAACGCCCTCACCAPiAGACCCTwCTTCGGGA 

CCCGCrGTATGGAAGGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGT 

GGACCACTGCCCCAGTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAA 

TGCAGAACCCTTCACCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGAT3CTGCCTQ 

TGTGTCCCCCAGGGGGAGGGGGGCTG CCT CCTCCACAAAGAAAACAAAACACTGCAGATA 

TCCTTCAGGACCTOACAGGAAGAAACATTTCGGATTATCTGGTGAAGACGTATGTGCAGA 

TCATAGCCAAAAGCTTAAACAACAAGATCTGGGTGAATGAGTTTAGGTATGGCGGCTTTT 

CCC7GGGTGTCAGTAATACTCAAGCACTTCCTCCGAGTCAAGAAGTTAATGATGCCATCA 

AACAAATGAAGAAACACCTAAAGCTGGCCAAGGACAGTTCTGCAGATCGATTTCTCAACA 

GCTTGGGAAGATTTATGACAGGACTGGACACCAGAAATAATGTCAAGCTGTGGTTCAATA 

ACAAGGGCTGGCATGCAATCAGCTCTTTCCTGAATGTCATCrtACAATGCCATTCTCCGGG 

CCAACCTGCAAAAGGGAGAGAACCCTAGCCATTATGGAATTACTGCTTTCAATCATCCCC 

TGAATCTCACCAAGCAGCAGCTCTCAGAGGTGGCTCTGATGACCACATCAGTGGATGTCC 

TTGTGTCCATCTGTGTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCC 

TGATCCAGGAGCGGGTCAGCAAAGCAAAACACCTGCAGTTCATCAGTGCACTGAAGCCTG 

TCATCTACTGGCTCTCTAATTTTGTCTGGGATATGTGCAATTACGTTGTCCCTGCCACAC 

TGGTCATTATCATCTTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGC 

CTGTGCTAGCCCTTCrACTTTTGCTGTArGGGTCCTCAATCACACCTCTCATGTACCCAG 

CCTCCTTTGTGTTCAA3ATCCCCAGCACA3CCTATGTGGTGCTCACCAGCGTGAACCTCT 

TCATTGGCATT AATGGCAGCGTGGCCACCTTTG r G CTGGAGCTGTTCACCG ACAATAAG C 

TGAA.TAATATCAATGATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGG 

GACQAGGGCTCATCGACATGGTGAAAAACCAGGCAATGGCTGATGCCCTGGAAAGGTTTG 

GGGAGAATCGCTTTGTGTCACCATTATCTTGGGACTTGGTGGGACGAAACCTCTTCGCCA 

TGGCCGTGGAAGGGGrGGTGTTCrrCCTCATTACTGTTCTGATCCAGTACAGATTCTTCA 

TCAGGCCCAGACCTGTAAATGCAAAGCTATCTCCTCTGAATGATGAAGATGAAGATGTGA 

GGCGGGAAAGACAGAGAATTCTTCATGGTGGAGGCCAGAATGACATCTXAGAAATCAAGG 

AGTTGACGA2iGATATATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCA 

TTCCTCCTGGTGAGTGCTTTGGGCTCCTGGGAGTTAATGGGGCTGGAAAATCATCAACTT 

TCAAGATGTTAACAGGACATACCACTGTrACCACAGGAGATGCTTTCCTTAACAAAAATA 
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CCATCACAGAaCTGTTGACTGGGAGAGAACACGTGGAGTTCrTTTGCCCTTTTwAGAGGAG 

TCCCAGAGAAAGAACTTCGCAAGGTTGGTGAG7GCGCGATTCGGAAACTGG3CCTCGTGA 

AGTArG 3AGAAAAATATGCTGGTAACTATAGTGGAGGCAAC AAACGCAAGCr ZTCTAC AG 

CCATGGCTTTGATCGGCGGGCCTCCTGTGGTGTTTCTGGATGAACCCACCACAGGCATGG 

ATCCCA=iAGCCCGGCGGTTCTTGTGGAATTGTGCCCTAAGTGrrGTCAAGGAGGGGAGAT 

CAGTAC-rGCTTACATCTCATAGTATGGAAGAATGTGAAGCTCTrrGCAC7A3GATGGCAA 

TCATGGrCAATGGAAQGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTG 

GAGATGSTTATACAATAGTTGTACGAATAGCACGGTCCAACCCGGACCTGAAGCCTGTCC 

AGGATTTCTTTGGACTTGCATTTCCTGGAAGTG'rTCTAfiJiAGAGAAACACCGSAACATGC 

TACAATACCAGCTTCCATCTTCATTATCTTCTCTGGCCAGGATATTCAGCATCCTCTCCC 

AGAGCAAAAAGCGACTCCACATAGAAGACTACTCTGTTTCTCAGACAACACTTGACCAAG 

TATTTGTGAACTTTGCCAAGGACCAAAGTGATGATGACCACTTAAAAGACCTGTCATTAC 

ACAAAAACCAGACAGTAGTGGACGTTGCAGTTCTCACATCrTTTCTACAGGATGAGAAAG 

TGAAAGAAAGCTATGTATGAAGAATCCTGTrCATACGGGGTGGCTGAAAGTAAAGAGGAA 

CTAGACTTTCCTTTGCACCATGTGAAGTGTTGTGGAGAAAAGAGCCAGAAGTTGATGTGG 

GRAOAAGTAAACTGGAXACTGTACTGATACTATTCAATGCAATGCAATTCAATGCAATGA 

AAACAAAATTCCATTACAGGGGCAGTGCCTTTGTAGCCTATGTCrTGTATGGCTCTCAAG 

TGAAAGACTTGAATTTAGTTTTTTACCTATACCTATGTGAAACTCTATTATGGAACCCAA 

TGGACATATGGGTTTGAACrCACACTTTTTTTTTTTTTTTTGTTCCTGTGTATTCTCATT 

GGGGTTGCAACAATT^rTCATCAAGTAATCATGGCCAGCGATTATTGATCAAAATCAAAA 

GGrAATGCACATCCTCATTCACTAAGCCATGCCATGCCCAGGAGACTGGTTTCCCGGTGA 

CACATCCATXGCTGGCAATGAGTGTGCCAGAGTTATTAGTGCCAAGTTTTTCAGAAAGTT 

TGAAGCACCATGGTGTGTCATGCTCACTTTTGTGAAAGCTGCTCTGCTCAGAGTCTATCA 

ACATTGAATATCAGTT5ACAGAATGGTGCCATGCGTGGCTAACATCCTGCTTTGATTCCC 

TC7GATAAGCTGTTCT3GTGGCAGTAACATGCAACAAAAATGTG0GTGTCTCCAGGCACG 

GGAAACTTGGrrCCArrGrrATATTGTCCTATGCTTCGAGCCATGGCTCTACAGGGTCAT 

CCTTATGAGACTCTTAAATATACTTAGATCCTGGTAAGAGGCAAAGAATCAACAGCCAAA 

CTGCTGGGGCTGCAACrGCTGAAGCCAGGGCATGGGATTAAAGAGATTGTGCGTTCAAAC 

CTAGGGAAGCCTGTGCCCATTTGTCCiGACTGTCTGCTAACATGGTACACTGCATCTCAA 

GATGTTTATCTGACACAAGTGTATTArrTCTGGCTTTTTGAATrAATCTAGAAAATGAAA 
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Genomic ccntig concaining ASC2 exon 1: 
Underline * putiCive prcmotcr element 



accccccatagaatgatacaattcctctgeaatgatcsgataacttcactccacccttoactttcaccttgaaggattt 

czzacccctcttggcccctcaaacttgactaccaaaacgtcgcccttaaaaataggaacacagtttcaggggcgagtac 

cagcccatgacccttccgcaaggccccctaactcaaggtagtctccrctiggaactgcggtittacggaatgtttcaggagt 

ccgaggacgca&aatttaacgctccccwagcaaggacacccttaagcacagagggcccagtagcatctggaggccagaa 

aacttaaactgaGgcagtcagattagct.tcaggctcaattaagctgaT:gggt:cascct:gggagaaattgcagcatgact 

cccaataccccctcccacccccacagcagccacgatctgtctgtctctaaccatgagtgcagtcaacctgttctttcca 

cetgtcttcgcc^tcagtaaccttgttaggcttgtccetgaacgtggctaccaacccaaagacacatgatcagagaggc 

aaCtagagaacagaccttctccaaagcaagcatgtitccgtitsggcttagaagCttcargtcctaatattatacgaccct 

gigcatctcnc^sgagatcaggcacatgagtcatacctigtigattcttgcttttgtstGaacatctcacgaacagccaat 

cagagctz^gccaccaatataccttcagttcatatcegacgcagtitaaatccacctcctgcttrgtagtctactggcaa 

cc~gtt:t::tgatataagacatct:agaacactgtaaai:at:at:aacat:ccttat:t:tgtczattatacctcaatt;acgaaaa 

agdcacccagaagcaacctcatcaagagagatactgaggccgggcacggtagctcacacttgcaatcccattactttgg 

cacgctcacgcaggtagat:cactt;gacgccaagactttgaaaccaccct:ggccaaca-gt.tgaaaccctigcczctatt.a 

aaaat;acaaaaaagC'tacccg99Cutcgcggtcggcacce:gt:aat:cccacctaccccggaggci:gaggcaggacaaT:ca 

c::tgaacctcgcaggcacaggtcgcactga9ctgagatcacaccacrccactccaacct:gggcaccagagt:gagatt:ac 

atictaaaaaacaaaataaagnaataaaaaagagacacatcgatagccgttgtnggaaatctcaacttccaticccacttc 

sggtaacztttvggaagtttgttgaacaaagtggaatacacQcacatacacacacacacatacTictctcgt^Ltgirtcaa 

ccttt:aa::caaatagci:gt:catacaatcachgt:tsccgaaagaggacaattacttgccatc:tgt:acat:crteactatgt: 

caaccawt:t:gcai:agaacccc5agaaacgcat:tcagaacaacaaacaaaatcat.aggagaaatagctaagt:g9gaaggg 

ccatataagactCgttgaaaaaattatcccttgacaaaccagccctaatgccaggcaagtcacttgct-ttgggaeaggc 

ctcagct:tct.ci:gtctataagattgcagcaggggt:9tacrgGgaatgagtcctcaacattccaagagat:£Ctatctact 

aatacgacagtcaaaeggagca Cgactccgtggaagcctctcctctrccacccacacgggccaatttctctstcccagt 

gagacgCwgacacttgtaCgacccctgcttggagacttccGtcttctggaacctgccctggcrcaggcatgacggctga 

ctgtcacccttcgatagGagcccaccactaaagctcaCgtgttggcagtgticcctccgggaaggaaaaagaccagccag 

cccatttjccactgcacaagcaaacagcttccgctagctatacagatacatscacttcctttcctcactgcstttccat 

aaacaga-ttagtgccgtagaacagtagagggcagccacgggaaggagttcccgttcttcttttggctataccaaatgo 

ggaaaaatcctcccatcttgtct:t:tttagtgtcatcct;ct:ctcccctrt:tcttcttctttacaattct.cat;ct:ct.catc 

cctcctggaaacgtgcatgtcaacttcaaaagcccacaacgtttcggtgaggaagaggtggcagaacacgtgccaggtg 

ctaactacggtcatcatttccccctccacagccagcttcctgtgaatgcgtgtgtctgtgtgtgtgcgtgtgtgtigtgt; 

ctgtgcgsgtgtctatctcctctgccagcaccactgaatctgtctgccgtctggtattccaggttttsgtttagggaaa 

agtaaaactaattttataatcccagctgtcaxi:taagccacccc:tt;:gt;gggcagcat:atggcccactctct:cagt:;t:ca 

ttgCccraaagatgc treat cacaaaggaaeaacttccaccccgc tactic tctgtcccctt act ctgccttactt-nrct 

tcgtcaetcctaccaccaccacccactgtttgaacaacccactattatttccctgttccceaticcctggtagaacagga 

gccccargaatgaagaaaccttecttctgttgttcaccaccgaatctctaacgcatsgaacacacctggcatgtgatag 

gcacccgataaatatttgttgtggctcatgggcaccttgcagagt-aacgccgcagttgtttgtagaacfcataagtgg 

taatgaatacttatctactattcctcttccaaggcgatcacacaataatcaggctttacactatccagttcttaggtct 

tccaagttatgacttgtgaggtatgttaatta wgataatagaaggcagtttatttggttcagatttattgatgtgtaat 

tcaccacagtaagacttcccczicacaaaagtatgatgagtcttgacaaacggatacacatgtatatctaccactgcca 

tgctccttttcagrctgtcgtctrcctccacccacgaccactegtcaccaccgcactgatctctgtccccttcatttcac 

cttttccagaatgtcatataaatggaatcatgcagtatgtagtttrttgtgtctgccttatttttctitagcattagcct 

tttgggattcatccaggttgtcgcacgtaacagtagcttattcctctttacggctgagtLaagtgtcccagttttattca 

tatatt-atttatgaggagctgtctcactc-gtcacccaggctggagtgccgtagcccgatctcagcticactgcaacct; 

ccgccccccacgttcaagcaat:;ctcctccGt;^:ctgagt.agctggcattacaggcacccaccgccacgccrc3act:aai::: 

ttcatartt:ttagtagagat9gggtctcaccatcttggGcaggci;cacc;;caaaCu:;tr.gacctcagctgatccgccca 

ccrccggcccccaaagtgctagnattacacccatgagccactgtgcccagccccacttttatttatccaccagttgatig 

ccctz^tcgacaactaattg'trccagcttt -ggcnattc-gtataaggctcctataaacattcacaaatacctaggat 

Gggacgactgggt.eatacaatagtactgtacaaccttagcagaaaccgc£aaacwacwttccaaaccggctctt:ccasc 

ttacaattccacagtgtattgagtcccagtg-ctccacacacatgctaQcactcttiaatatttaanTrtagtgggtacgt 

aatgatatctcattgtggtttzaatttgcar^i-ctctgcagctaatgatgagtgtctctgcetacccgggaaggtctca 

attcagcagtctgtcgtattcttgtagatatraataactccaaaataccagtggcatttgcagttaaaatttccttaaaa 

aattggccaaaggtttccaccagtcactrcTgccacgcccaaactgtatgaaacaaggctgaggcstggagancgccac 

attctggcaaggactgatccacttig9gt;gac~3atgagacccagagaccctacgcctcgggcttcagggtgaggacsgg 

cggcaagccgactgcatgecsctcctcgcc-zgggaggctgcccagcccttagctaaagccggcagctatgggaaacag 
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szggcacaagcrgctgaacacagagcccccccctcggggataatttgcccaaaticatccaaccaggcttaagaaatgac 

i;ac:racacgtccaggact.gctgccaccc"t,g3at:wCCgacaccC"azczcrsc"arccgcct:cccaaccaat" aagca 

gacacccccaaccgcttacgacaacccagcaccctitttcgcacactaacqaaaacagggatgaaggaaacacaaacgg'cc 

X it.gctcrcacrcagaacgtagaaatcCwCtt:tcccagccaagtCwtcctaccQaccacct:accaagagctctca.accc 

acgtgtcaectccagggcaggatatcaggaaaggacattgaagaagzsgagacctiaagtttgacacctacgcattagcc 

aggccagcagtcctcgaaaaagtsccrtaggacaagagaactcaccaccgaagtcccagtiggtaggagagcstgcagca 

cattictgaccctgtatacacatctccacggcaCCecttagcaggtcccgagtggcaagagagcaggctgcacccacaga 

agccaggccaggtagaccctggtgaccactiggactictacettcacctcccgaggaccrggcaaaaggtrctaagccggg 

sacaggcatgc^cacatatttcgccracctgagtaacczteggcGCtrcgtgacaaatgg^cgggagaccagccaggtg 

ccacttgccctcatctacgagcaggatcagagtggcctattcactcgcacgactgtgaagtgggaccccrtccagccag 

raaccgcaaacgtgtatcagggcagaagtgagtgcactgcatccgaaacattgacaaatccagiiacataccactgiictc 

stC-ataLctczttttt^cctCtttttugacctteccttgttcgtccactaaccccgaaaactgatgcggaaatgtccct 

t^cgcctcagciacctgagcagaaggggccggccactgccaaacccrcccctcagcacagaattgctcccagtattgat 

caccgcgrtctaagttGcgggagcaaattgtgcaggaggccaggtcactgccaaggcgggtigggaggaactggagcagg 

aagcttcccraagtgtgcccagcaaagccacgscagaacttTictactctggccctacgctacctctcagcaaccttctc 

cat:gT:gct.tccr.ggagact;ccccggagtcagaaccccCwtctt:gaaacccagacaccttact:tccaagaaaaccctgtc 

caagaaaactcatccttccctccticcticacgaacgccgcgcagacgzscgcctccccttcctttgascttttccactea 

SecttttaggggasgtgacatCctaC3ttt5ggtt:t:ggcT:ctsggt:ac!:gcaacact:aggcT:act:aaQattccaccctt:a 

rtgcttrgcccctcctacctttccagaaacccacaatggatttgccagaaataaCGgaacgtcctgtttggacaggata 

taaccat:t:tcticagc!:agaggatattgtzcgaatgaagaaagataaacQgggagaacggaact:cacattgcttt:ggcac 

ttaaatcaacccaCgt^act.gtigiitgggaaatCa-cr'CatiatitiaticcccccgaacccacagCagaacacagtrgaaca.cca 

tacaaggcaagtattgtcatcctcattttaccatgagcaaatcgatccccagagaccataaagcctitggccaggggcac 

acagttigggaagccggggctaatccatccctgggccctttctgatacttttccttz^ctaattgtccectcctcattgc 

^accttggggacctcaagagactcarccagcctctaaatcaacgaac-gattcctggagagcagcttctgcatgagaaa 

aatctacctaatcatticattCcagcgtccctiggaacgcaacc'tct.ctcctigagccacttiagaaaacaat.wtgggatgac 

aagcatccgtcrcacaatgccectccggttgccagt:gccgcgctsccagtcgtcat:ct;tt:gaacaaaccgatgcagt.gc 

cgg::t:ta3ctctt.cctcctt;tt:ggagcaagaaacct:t:ggacgcccg^grcctcci:agaagt:ttgctgagcaaacggtaa 

ggaaaacaaataggccctaacccttgaccatttcagagaatt tcttgatttatcggaccgtcaatgaatgaattggaat 

acatactggtaggctgtccccccttcccagacactgcaatcccctccaatctcctgactctitictagaagctctaateca 

agccct^gctgggtLggcagataaaagggtattgttccactiagagaccgaccttgccaCggagatctcacttggactcac 

agatttctLagtctagcgcttcgtcttgtatccatiacctcgctaccgcattcttacttccttctgctccttgttcctcat 

gcccagtgtcceaccccaccctcgcccctactcc^ictagaggccacactgattcaccgagccatttcacaagcacagcc 

aggagagcucatggctaccaagcgccaccagggccgaattttcacctgcgtgtcccccctiticcacttttcatctztctgc 

cccctccccaectttaactctaatataactacttgggactactccagcattaaataagggtaactgctggatgggtgsc 

cgggatiacacagaatgcagtatcccrcgcccacgagaagaccciiCwrgccctiagcatggcaaacagticcticcaaggagg 

cacct:ccgdcacccaacgcactagggg5gcgcc.gtgtwca9gcgc3ggtggaacaacgccagaagt:gtgcatatgcgct: 

gaccacsagagcctgtctg'ccggcttcacagtt.gatgccccgagcccgcca tagcagacctgtttctrccatgggatgct 

Stt!;tc::!:tccagagacacagcgct:agGgttctccT:cactacct:gacagccaggt:gtcggtagcattt:t:cttggtgt:tt 

actcacactcacct:aaggcacgtt:gtgcccttccagactacgaaactact:t:tattgatggt:gctt;tct:ttt:tttt:t:t:t::i 

tgagacagagtctcgctccgccgccatgctggagtgtagcggcacaaccttccctcactigcacctccgcctgccaggT:" 

cagcgatuCtcctgcctcagccEcccaagtacctsggaccacaggtgcctgccaccaCgcccagctaatttttgtattt 

t cage agagacggggtctcaccgtattggctaggatggtcccgarrccttgaccccgcgatccgcczgcctcggcc tec 

caaaaT-gf-t'ee^at-t:ataeg=rt:gacccageacacctoGc^g5t:g=:t!ceti tit tear eatittaaaacacteaer ggTia-tia 

asccac-caaaattagt.at.st:aaccaagtccagcgaat:accat:aagt:cacr.ccaccctt:gaggcaaaat:ttacaaat:cc 

tgctgactrttgtacgtaaggggacccattttcrtagaaaacagacgtaggtctctgggatitccagcatigccatttccat: 

ccrcactgttfccggccacc^gagagaggtccattttcagaaatgcacticcccattcccagatgataacatctatagaa 

ctaaaargatcaggaccatiaacacgtagcr -c::agc=t:c::;tct:c59uacaccccccga9tccctctzzgtgggt:gaacc: 

cagaggcngggagctggccactcacgaTicca-trgacaagcagtcacgaCgcagagctgtgtgttggagg 

gagggctggattagcagtcz-sattggtgtarggccttgcagcaazaaccgacggccgtitccccctcctgciitniacccr 

tcagC-aatgaccagCCacggcGTCCCTGCTGrGAGCTCTSGCCGCTGCCt-rCCAGGGCTCCCGAGCCACACGCTSGGG 

GTGCTGGCTGAGGGAACArGGCTTGTTGGCCTCAGCTGAGGTTGCTGCTGTGGAAGAACCTCACTTrCAGAAGAAGACA 

AACAgtiaagcttgggctttizsagcagcggggggttctcccattctttcttcctcgrtttgagtcgccgattggaggagg 

9*'S95agggaaggaagctgcg'i-ggt:tttcacacaagga--gatcgaacct:ggctcttatggacacagaactgt:gtiggt: 

ccggaz.atggcatgtggcztaT:ca!:agagcgcagatrtgcagccaggtagaaatagtagct:r.tgct:t:cgt:gct:accgcc 

G&ggcasgagttctgatcccraggacctigccrccgaatcgccccticagcaccccactttctcctittrgctgcagccccg 

gcaccacc-ggcxctccaaaagciscciiaatgggcccctgcatccctcgaacctgcgggtgaagtgagctagtg 

cticctiagagatcaataccggccatcrtggtgtcaaccugcaTiccrccccttcacgcctiggaggaaGacaacaactcaga 
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citgctttattrctgcacacggtcctaasceattcacttcrccagatgcgccaacaacccTirS ccaonaatrctccagacca 
tcctt:aa^9ccca9gt:gaat:ggaac^ct.tccacacag9gatgtg«gcgctgt:agacc»gagtgaaccccct:9aaact:ca 
cacgtcagctc^ttgtctctctatccctgaacacccttccttagagatcccatctctaggatccacctctctgtagtta 
ctztctaagtctcttgctcctgttctgcctttatttttttctcctgcattctaacccagtatccccacttggctgtctt 
aatgtagcttaacacgcctgtaatcaaaatgaccatctrcctgagarrcaaagggcza&aagggactccggagagaatt 
tcattcagttttcetcaaactagaataatgcttgcactgtctgtaaaagaacaaaactgtcaaagcatccttttgttca 
ctaaacctccttttt casta tagtgttacttaaatactaggaagttaaaagtaggtawaaacttcttataggctgttat 
tatacaactacatgacccatacatacccacaaattaagtgcagccaaaaccgcaaaatcaacaccactcaaatcaatac 
cttaaaigtgctgaggcagctgttgttcaactgaaaccaaactataagttgeatggcagtaaatgctatcatgctgatc 
attttgagtttggccagtctatatzatcatgtgcraatgattgaatcctccacccatcttitccacttgtatgacct'caa 
tttcatcccacctgttccatcctcacgagtttgczacaattatactgctgccaacacaatcataaacacaaatataaac 
ctgggcttrgaaatctrgtgccagaacrcggctittiaaagtaagcatztaaaaaatccatargtgtti&attagactttgt 
ctagaCGactgttgaaatgaaaacaaagtgtttaaaatrcctcttacagaacttaaaEataatcectcagcaatatgtat 
acagatCT:tcctttgagaaaaactcattgtgt.t;cagcctctcatgc=acaaatgggsaacctgaattctgaggtctGta 
ccgagagaacaeggactggaatctgcggaccccacctgtcttaacaacaattgcaaagtataacagataatatcatatc 
aaaaagagacnxinnnnacact:tacaacgagcct:ccatgc.gcgaggcaccaactgac7agscatt:att.aaci:agaettat: 
tcottczaaggccccgcgatgtactgttattcccacatgccgtagczggggaacgcgctactcagagaggtttaagtaac 
crgtctgaggciccacaccactaacaaggagcacaggtagggttcaaazccacacaacctgactttggagctiggcactct 
aactcaatgtgcccaatcgcccctcagtggtgtcactactccgcctatcctccaticrgagaatactgaagtctctgact 
ccttcct:t:gcctttctccctgcc*cccgtggtt:atecccaggrctC9gi:gttccagtcctctatgtccgtcctt.act.ct 
tatCccrtcgctacagtgtgaticcaGggctccrgccctcctCatcc'Zcgtacaggccgcccacttgctgggaaactgtc 
rccgccarggcctaticcatgt bg~et;9Cccatt:act9agtagc.g9aaagaarcat:avCat;gtt:ggcaatgaaagggggg 
ctatggctctggggtagtctiagtctgaactcctaiintt 
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Gencmic ccncig ccr.caining ABCl exon 2: 

cittCtttSCttccttcstttctttcttitttgagsccaactctcacTictsttacccaccccgcagtgcaatggagcgatc 
Z-Ggctcaccccaacctcigt:ct.cct:gccttcaaacast::cv:cctcccrcagcctccc3a=tacctgg3actacaggctc 
cscccaccatgcccagccacctttctgcattttcagtagacacgggccttcacccttctigaccaggctggtcttgaactc 
ctgacctcacsatcaacccaccccagcctcccaaactgctcggattacaggtctgacccaccacgcccggcctcataagt 
aztttctaaacscatttacagtcacgccacttaaaaggaaagttgtas^cctgcctrtgttaatatttataagtgatttt 
attcagcracaagcttacaatggcataraactctgtatcctgcttttrccacrtaatiattacacggctaatgatttctgt 
ctircat;aaacantattctgatgatgccacgatatattgttgagtacatgtaccac.aat:t:gaascatt:tccctattgcta 
-rcaat:t:aactrgt:tcccaacatt:t£ccaat:tataacigtt£caatcaatgaataactccat:gcacat:agcttt:ttgatat 
rztaagtccactttcccaggatgaatccccaggaatiagcaatcgggcaaatgggataaacatgacticccgaataGgtatt 
ctnaaca^CGCtrtcccaaaGcgctcaactgacttatacttccgtctccactaccttttaaaccagctcatttactcacc 
aaacat^-CiaaagccaCtaccatgtggtaggcttagtaagaagaasgcgaccctaasggagaagctcatatataaatag 
wCcccccggtc::accaagtgctgatacaGacacaaagcacctgggcaaatcgacatsagggact:cctggctcagctggga 
gaaaagc'catttccacagagtcacsgtt&tgttcctircgcagaaagaaaattgcttcctiticcccacccccacccccagc 
rstattgacgtataactgacaaataaaaattgtatatctttaagatatgcaatgtgaCatatacgtatatcticaactitaa 
aaaataaccracagaat:aaaaaggt:gt:tt9Ct:attaaaaaaaaag5aaa9gctgaa'gtcat:tcccaagctt.ggaaatt:t 
gagcatgcTisCctcttccogactiacttacagaaataccagcaagaccragccccatcsccggcc'Ccgagtactccacrgrc 
accatigctitcci:ccagagagcgatccacttgcctttatt:tt:tcatt.ccgctstgccgtctacgcaaactactctt9ata 
gtictatgczaacagtctirtttttgttccatgagatiaaatttatacacgctcattgtggaaaatttagaaaagacaggaa 
actat:caaaaacatcmcyttt:ttt:ttt:tctttt:ttt;ttcttt:ttt:cr.amgcagacagagtct.t:gctctgt:cgcccaggcc 
ggaccgcagtcgcgtgatctcagctcacagcaacrtccgctt.cccacgtttaagcgat.tctcctigcc'ccagccrcccaac 
tacctgggacracaggcatgcaccaccacgcccccctaacctl:gtat:c.tit:tagi:acagatggggct:t:caccatgtt:ggcc 
accctcgcctcaaactcccgaccccaggcgatccgccccccttgccctcgcaaagttctigggattataggcaggagccac 
racgccacccacacctacgctctitaccatcctagtiacatccactgtcactatctcgctgtatttccttctgcccagtGtc 
accccgatcatgcagtgccgtgatcacgcagtgatctcggcrcaccgcaacctaggccctGtgggttcgagTigatCCtcc 
rgccctagccccccgggttcaagtgactctcttgccrtggcctcccaagtagctgggattacaggcatacacccccatcc 
ccacctaatttrtgcat-ttttagtagacacagcgcttcaccaaaatttirgtatttttagtagagatggggtct caeca tg 
-Ccgccacgc'cgtctccaactcctgaccccaggcgatccgcccgcctrggccticacaaagcgattacaggcacgagccs 
csacatccatcgccaaaaagatttctcaaaagagTittaatgcaaaaccaTiaccaaacgrctttggaaataaaaaacagtt 
ttncaaa.aat:arcagaaacaaaacaacaaat:aaat:aaat:aaataaaaacacc;caaaacaa&ct:gaagcacgagcacct:ac 
cagaaagccTicaattacgatctattcatagagtgGaacatcaagtacacatcacagcacatgttttaagaticatatttita 
rgtcacgggaaatgctcccccagtacgacgctaaatgaaaaaacacaatacaaaagcatatacgctgcatagtctcaata 
Ctatagagaaaaaatattatttatgtacgcacgaaaaaagacaaaagatgccaacagagatccattgttacttcagttta 
ctagggatrgcctctgggaggtaggattaaggtcatttatatctacctttttaaacttttctgtatttttccattttcaa 
act^tcca^aaaaatacaaggacttgaagat-caagaaaaaacttctgctttggctcag-cgcagccgccacgcccg'CaaCC 
ccagcac::t:t:sggagccctaggggaoa9gatca.cctgaacccaagacttt.gacgt:tccagtgagctatgatctccggatc 
gtaccccGtggacgatggaccaagaccctgtctcaaaaaaaaaaacccctgct: ttcccttttcgtttgctttttgagacgc 
agcctctcccrgctsccccagctggagtacagtggcacaatctcaccTicaccgGaacctctigccticctgggttcaagcga 
■ tcct:cttgcct:cagcctcccaagtacctgggat;;ccatgcacccaccact:atgcccagci:act:ctt;ctgt:atttt.cagta 
cagacaggct:t:t:caccatgt:::ggccaggctggteT:cgaattcccgaccrcagct:gacccacc3gcctt:ggcetcecaaag 
tgctgggatracaggcatsagccac:;g^gcccaccccaatctcttgcct:ttt:tcaaaaaaagaagacaaaaagggattt;t 
a:iaccact:a""atcttGgcrgcgcgact:ctgaagccacaG-t:gtaagtcataat:tactccgaaacacaaggccctgtgac 
tctttteggccctttggtGZTicatc'ctgattacaacgtitgaaacacacaaacgaaaggaacgggacaggcgatagacctc 
aggcagcgtaactagtcgtct:gaaeactact:cgctcaat:t3tacr:gt:gt:ctagt:gatt:t:ccatcttgtccgt:ctgctaat. 
Ctaccgcctggtaactcacrgaggcagggtct-ectttgcagaaacctcactgttttaaccagtgtaccatgcttgttta 
gaagttcaatgacctttttaactcatcggacaagargatgaccagacctggacacatggggaaggactttgcactctictic 
tz^acagccctgagtgcacacaggtcaatatcgaaccatgcgtgaactwCcactgwCtttgagagccctGttctGCcccG 
c3t:aggcagcagcti:tgtgt:gcaaccagagg3gcaagggc::gtgrgt.at:tt:agcacagcaggt;t:gGcct:cgt:cct:ct:cct: 
ctcaacacagccaccacataccrggcactatcccaaggctgggaa^gcacacagatgggtgcctgctttcagagtgctca 
atacgc-gaggaagccagcaacagaaacaga £gatttG5cgagc::ccaggaaaatgct:acaggaggagcgtgcct;gggt:t 
actggagtagcacaggaggagggcrtictagccGaggctgacaticztagcaaacgaaattatgccacgacgaatcctgaag 
aatgaatagaagcgaaccacazaaageacgatsggaagcaccttcccttacccaagggaagacacagaggtatatccaat: 
9S^acg^taaaaggt:tggcaccccaaacagtzccgtcaaagcti:agagag:iggtgggagagactggagaagttgatt:aat; 
cagcaaataaagctgtctgcgcacntcccaga^cccagtggcatrggatatccatatcatttttaaatttacagtgttct: 
aCCtCatttCCCactcacTSTCAGCTGCTGCrtSGAAGTGGCCTGSCCTCrATTTATCTTCCrraATCCTaATCTCTGTTCG 
GCTGASCTACCCACCCTATGAACAACATGAATgtaagtiaacrgrsgatigcrccccgagacucaccaazggcagggaaaat 
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ccaggcaactaacgtggectaaattggactcttccaaagatgctgtccttgggaaacatcac3catgctttcgatcagaa 
aacctacgcttctaatttcttgataaggcacgaacccagcagactctcttcagtcctagcgaatggtgataactgcaatt 
acaacagtagacaacatccctirttacacacticitiaaatcacgaaaatdgaataaccttaccgacaattttagaaagtggcg 
attaaaaccacatttaaeacaatgccrtaacacctagtctcttccatatgcacgatsticctaatcacacattccaaatca 
cggaacacagaatttt 
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j«ncmic ccn-tig ccncaining ABCi «xcn 3 : 

aicetacaatcacagtictictctcctacggcCggccccaccgggcgcatitgacactccagaaacggccagatccaaaggan 

ca^cas- -acg-agctsgsaaatgzagc^gsgacctcacttzcacisccctagcaat ttttcccaccactaagcagctca 

gcccataccccTiacgagacccacaagctcacgagatiactgctcttccaggaaagcaccggggccagggccaccccTitaa:: 

igtgtttcttcgcctggtcccatcttccrcacaatatatagcaacagttactitact^gctgactctctaatgcacatcac 

aCatagtsaractaaacacacacacacacacacacacacacacacacccctcaagaaacattrtictgagacgtgacttcc 

cgatttcatcaaaaaagaaaagagccgcccaggcacagzsggaacticaaGctggctsgatcacTitgaggCcaecacctitg 

3aaccaccct.ggccaacacggtggaacctcgtctcT:actiaaaaaracaaaaat:::accca9gcgcggcggc9cacacct:gz 

aatcccacccactgggcaggctgaggcacgacaattgcttcaaccrgcaacgctcacgttgcagtgagcccagattigcgc 

::;atcgcaccccagcct:cggcaacagagtga9act:crgt:c::caaaaaaaa3aaaaaaaaaaaaagcataaactgaaattta 

iiatgcaatccatatgcctgtgagataatwCtgttttctctttcgcaaccccaaagagatttttctgattgacgagcaaat: 

acaccctagactttatctaagcattacgccaagcaccactgaagtataagtttcaagggcaaactcagtttttccatcta 

ccacacgaatgattttctggaatgattacaagcaggcaagaCggtctiagtgcaaaiagcaaatgtcttcggcatcagaca 

agttggggtc-gtttigtaircctigcctctgccctceaccgaggttctigaccttggccagattgttgagttttaacctaga:: 

ccctcccacvccagatcataaattttcagaaaacttctcaaattczrgtatatacrgatggtaaatgagacttwtcctta 

=atci:a=5cacttctctgctccct:tgt:tttgagat:ggtcttgctcr.9ttgcccaGactggactgcagtagtgcaatct:c= 

GCtcaccacaatgcccgcctcccaggttccagcgagcctcctgccccagcctcccaaatagccgagactacaggcatgtg 

ccaccacgtccggccaaSttttgnattrttagtagagacagggtcctiaccawCttsaccacactggtcccsaactGctgs 

cct.caggtgattt.cgcccgcctcagGctcccaaagtgctgggactacaggcacgacccaccatgcccggccataricca'Cgc 

actccccgcaaccttaccttctttrcccatcaccctccagggaccragctgcaaaagcagagttaaaagtcaaggtgaaa 

cttggagaggrgtctttgtccctaggaacaaaggactggtttgaaattctctgcaaatcttccccagttcaaaccagagt;^ 

atcaacgccttaaaaacttccctgggtcctgagagcGcattatattattcacttgCcttccttgtacacccactgcctagt 

cctgaccctactticcgttCgcaaataggacggcgcacaacgtacaaggaagggcctttgccacccctgctaagggataag 

ctgaaacaccctcaccatcacwgccctgcgctgctitttcacccatgccagtctgcctiacagtgccagtgtctcctggca:: 

tgaaaccggagaatctctitgctcctctgagtattitggttgggttacataaacctccccgaatgaagagcacccgactcag 

gcaagcggccttgtttggttttcctcgaactattaacaggaagatagogaga'ttaactgtgtaaatgtncaatiaggccag 

actcccrgcagag9gtggccacagt:gatcagat:ct.taccacatcc"tsctttggct:gtitgcctctci:ggttggagt:anag 

at:agaaaagaaagaaagaccctat:at:tgaaatgcaaagt:9ca9Caagtcctgact;t:tggact.aacttctcagcccat::::g 

catgaaaataaaaagatgaataaaacaaggttcccactttggaaggaggtggtagccgtgagatggaaggagcgttcctg 

c"sggcaacagcagagtaagcgctggggt:agaCwCact,cccacagcgcctggaaaat:cctcat,aggctcat;ti;:gt:t;gagt; 

ctttgt:cctacaccaggcaccctgcaaaaacgcttt5cctgcaaggtctcacgceat:gctcaccacagetctgtgaagcc 

aart:gE,actt:ttatcaccat*ttacagacgagaaaact:gagggcacggggtcaaCcacttggctaaagtcactgcttagc 

aagctgcacggactggatgtgaattccaattcctccgaccccaaagcctgtgaagctacttgttcttcaccacctagaGG 

tccggttcttgataactgtgaactcctttggccucacaaatagccctgagaatatgatagaagcaggagctctggccttt 

ctgtccatacctgaacaggtccttgggccaagacccccccgtccagggcctattaatcttgatcctcataagcagcatcc 

atgca*tacggccgcaaaccaaactgcgccacaccgaatcctaggaccaagcccaaat,atgticccatcatccttt;t.ggt.a 

aGaagcrcattgtaagaaacaaagagcagagcaagaggatgacctagtgcacgsggcctcatitgttttaatcagtgacaa 

aacaacaataat;a4tcaacaaaacccccgaagc = tcacagacg3C3kcacaccccaagGCtgt:gt:.gt:cci:t:caggtgccct: 

tgagcagctttgtagctggcagaggaggtgaaacugacaaatgtccggcagatgcaggagagtaccagaggggtttgaga 

tgagc-aaattccaatccaaccgcagtgtccacgaaeaggcttggattgggaccauggagatgggggutctactcccagt: 

cacgccagccgactttgcgagcgttcrccgtGagtcactttatcttactttaCtcatttttatttttttgaaatggagcc 

tcgc^crtgtcgcccaggctggagtgaaatggcgcgacc^^ggctcaccgcaacctccccctcccgagttcaagcgatcc 

tcctgcGtcagcctccagagracctgggatriacaggcgcctgccaccaagcccaccgaacttttigtatgcttagtacaga 

cagggt:titcgccatgttgsccagggcggtct:rgaacrcctgacctcaggt;gatccgcccacctt:ggccccccaaagt;gct 

gcgattacaggcgcgagccactgtgcccagcccactrcatcttaccgtacctacctccttagagtatgaaaaaatagcct 

taggccatccGcaagtcccccc^atgtctgagagctgacgctcgctgccaaagaggaactaaggatgccagggactt 

caccaggaccGctctcaccacctctccaacgcrggT^iatcataaaceccatcctacagatgatgtccaccagaczaagaat: 

ggca-groaggccaagtccccacctgagagicagtcctacccagaagagacacctctctgggacctiggggaacgggacgg 

acagacttggcatgaagcatirgtacaaacsgaacctcaaaatcgcttcacggaatcaatgtttctccctgtgtttttcta 

CCCCiCaaCCrcaacagGCCATTTtCCAAAT^AAGCCATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGG^^ 
GTAATGCCAACAACCCCTGTTTCCGTTACCCSACTCCTGGGGAGGCrrCCCGGAGTTGTtrGGAAACTTTAACAAATCCATC 

caag^atcagatcaggt:i:t:i:ct:tT;ccaaac--gtcagczaatcctt:t:t:cc::i:cct:ttcttgtcct;ctggagaatt:ti:gaa 
cggc::ggat:ttaagngaacr^gttzttgtaaatgcttgcgcgatagagtctgcagaatgagggaagegagaat:ttt:ggag 
aastEggggcatttggggtacccatcacctLcgagtatttatcatttctgtiacgttgtgaacatttcaagtcccgccTigct 
a9Cuat:ttttggaat,atacta«atgt:cgtt:aa:;ca"atcaC9cacca3acgt:gcatctgaatgccctggctctaggagcta 
sagggcagggoctggcacssagatgcatigczsgaaggaccGttgcccataagaagcttacagccaacgctaggggacttc 
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tctcctctctccatcacctcacctctctcacctccgtcactgccccaccagactacaatgtccgcaggtctttctcccct 
cagtgtgagctccctgagcaaagcaggatgctgccccttccctttgcattccttgctccttgctccagtgcctgtacata 
agtacgggcataataagtgttcccccaaatgagacattgaggactcttcaaatgcacaggaccscgatgtgagccaggacg 
cagtaaggacgatgggatgtggctcatgacaatcctgaggaagctgcagctgcggcacgcagggccacaccgccatgttc 
acggaccctagactggctttgtagcctccatgggccccttccatacac 
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5EQlDNO:J7 

Genomic contig ccncaining A3C1 exon 4: 

--ataacTicccattaatataaagataaatacaatccagaccagattcatgactactcatacattCttagtgtaCCaactc 

^.aa^tcticctttiaaacaaactaaaacantcuaatatgcccrtaacagtaccccagcccaggccactgacccrac^ 

cattcatcsacaagtttgcccctgggggcatgtgtgtgcatgcatgiigtgtgcacatgcatgacgagccHggccttg^ 

cccggtailacctgggtgtgtagaccaatggagaaaggcatrtggsgcagtgacgatgggtgggggagg^^^ 

r=aatgcaictgggtgtagggag«atgsaagtaggttagggccagcccgtggaggacctggsagccag^^ 

--cacttcccagccactcctgtaaagcascagaaccagrtggcccacccaaagcctrccgccttttcttgcaccctttac 

AGTTCTGAGAACATTACAGCACATCAAGAAATCCAGCTCAAgtaagtaaaaaccttccctgcatccgtttataactggaa 

attgacccgcaccaggcaaagagagtagcccagctgtctggggcttgttcccattacatcttccccaaggggtttttctc 

ctMqtccccaacctgtggggcccctctccaggacccattggrgaagaaactaggccagctggttgccacagacagtgat 

ctactaatcttctctgcaaagacagaagaaaagcccccagggaagaatactacagacttggccctagggacagctagggg 

-gcaaacccccgccaactgcatttciitctgaagctggccatatggttgcagtgaatcgatttatagacagagtattttctig 

;LaL=aIaagcaattacagttgtaasttgat:atggataagcgaaagttaasGa«tct:trtct:aaaaagagaa^^^ 

'cattrncccccaatcanctcaatitagtctgatgcgcatttgaacttgttgtccccaaaaagtgaaatcttitacctctga 

-ctgqtaaatatccaaccaattccttgtscgccacccaggaggtatccggggagtccgcattttctgactgaggcattgg 

^tgccacaicatcagaccagcctrccaggcagtcccctggcaaggggacaaaggcuggtgggagcagctggctgagtgca 

cccagtaacggcatgt 
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5«ncmic ccncig ccncaininc ASCI «xon 5 1 

». 

acctccccacgigatt;ctgacgcataciit:aacccceagaaccattsc = ::gccct9caitaaaca9gagattagtccctgc 
acctrgcgceaataaagctteaaatctccccaaccccaaccccgtgaaaaggcagtecggagacaggaacsaacggacta 
cCGccacaaacct:cagscgs3Stggccgacaccatrt:agaagagaaacaccgggcarggtggcccacgcctgtactgtca 
gcacttcgcGacgccaacgcaggttsgatcacaaggtcacgagtctgagaccagcctscctatcatggtgaaaccctgtic 
^^cactaaacataaaaaaaaaaaaatctgccacticarggTigatgcazBcczgraaccccacctactcgcgagcccgaggc 
accagaatcccttgaacccgggaggcgggcgtcgcagtgacctgacatcccaccacrscacticcaacctagstgacaggg 
sgagacTiccgtctcaaaacaaaaaaaaaaaaagaaaaggaaaggctcrgtgcgcgtgcatgtgtgcgtgtgtgtstgtgt: 
ctgTigtgraacagcaccaccacactctttgagttcaggaccacatGC^gagtgcgsc'caacacgctaccagaaagcaat 
arcttcatcccsctcccgatatggcgatgctcccctatcccaCtccrgwgtgtgttEagccaggcaactgttgateatca 
acattatcasaacgtttctccactgttcccattgcgcccacnttttttctttctt-cagttactcactaaacaaaaataaa 

aca C tia 1 1 1: Cticaa CacACTTGAAGCTTCAAGATTTCCTGGTSGACAATGAAACCTTCTCTCSGCTTCCTG'r ATCACAACC 
TCTCTCrCCCAAAGTCTACTGTaOACAAGATGCrGAGGGCTGATG?CATTCTCCACAAGgtaagccgat:gcCt.CCagctC 

::c£cagt:aggci:tgat:cgcaatcacgctgtgcagct:actcgaaagaaacgaat:aaaccct:t;gtcct:tgtaat:ggcggtga 
aggggagggaggtagcctgaatacaacttcacttaattttacttcccratccagccaggaattgccaaaccatccaggac 
tggaacatgcaacctggcgtcatgggccagctgctcaaaataaaattgatttictgccttatcacttggcatttgtgatga 
=ztcctcczacaagcgaracattttaagtt§agtiraaaci:taaaaaat:attcacacstctgaagcaataaccgt:ggttaa 
gggtcattcatctggaggagctctgtctaaaaaaztigagcacaggacactttagacaagggtgtacttggagactitttaa 
gaattttataaaataagggctggacgcagcggcaccgagtitgagaaczgticgcttsctttgcatitaaacaggagatcagt 
ccctgcagct:cgtggcaataasgct:t::aaatctctccaacuttacc::=tgt:gagatggcactggegaaacagaaatgaac 
=gactagtgtcacaaacct:caggt:cggat:ggacsaCTatcact:tcaaaggeccgcaarcccacgCct.acaaccccagcac!; 
tcgggaggccaaggcgggaaaaccacctgaggtcaggagttcgagaccatcctggccaacaatgtraaagcctgtctctac 
taaaaatatgaaaacnagctcagcgtggtggcatgcticcTigtagtccGagccacccgtgaggctigagacaggagaatcg:: 
tt:gaacctcecaggccgaggctccagtgagccaatatcacgccattgcactccagcct:ggctgacagagtgagactccat 
ctcaaaaaaaaaaaaaaaaaaagaatcttataaaaccagcaaataacantiagtgtttatgCcgaattttaactttagaat: 
eatagaaaactccctctggcatcattattagacaactGtcgtgcag-gggtaccacGagacccagctitgcatggttatwC 
atttttcagacacacnttttgagcttiatcctctggqagaaaggggaaccacctcctcccctatctcgtigcctgcatacca 
ccttgncTinJiacaagaagcaaaactagtggaaacgtttattcttgaaaataaccrttttgctccacatgaeccagaatt:- 
;:taaaai tagaaaaa tgtgct caccgcg 
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SEQ ID NO: 19 
Genomic contig coneaining ABCl exon 6 : 

agtaaaatscagaattccaaattctgaaattgttagaacacagttccgtgtcttacttaaatatcgacacttacagataa 
acagcataaatgccttctccccacacttcagcccagtcctacttaaagacaacataaattgcaaaatagtgaggatgttg 
tncatctaataaaagtgcttccaggaattcagactctggattcctgttitgccaaatcatgtgtcccactcttaagaaaac 
eagttgcactntggattcttctttgcaagagggacaagagngtggoagatactgacctaatgcaacttgcaggttttaag 
rgtcctgtcattgtgccttgtgctttgatacattctgactttcactaaagagacctgacgcactggactgttgcaatgga 
acctgtctt:aagat.cttcaaa9c:tgtattigatatcaagtcctccaaaagactccaaggacccagcti:ccaaccc:t:cataa 
rcctct cgtgct tgtctc tct t tgca tgaaatgc^ticcagGTATTTTTGCAACCJCTACCAGTTACATTTSXcAAeTCTGT 
GCAATGGATCXAAATGASAAGAQATGATTCMCTTG;QTGACCAAGAA6TTTCrGA6CTTTGTGGCCTACCAAGG& 

CTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTGg tgagc agact t ge t cac tggag 
aaacttcaaccactaatgctctcggaatgtigaggcctttccttsgacagcatgactttigttttgtagaaaagtacggctg 
cctgsgagttcgtgatataatttagttcagtggtattctaagtgttcttagtigtncttccagacttttgggccatetccc 
aaagggcgaacgggaagaataagcCgggtgtggcrgagtttaagccaaaagccttCtgtgcttgtttcaatcagagaaga 
cckgccccticcatgtttttactatcataatactaagcaagagctca&ttigaaaacagagttcttcatattitaaaaaaaaa 
aagtcGtcaaaccattgacgggaagatggatatctattcatgtctaaaaacccatcataaagatgacattigtgggctgtc 
acagttggaaggcGctcgaat:tagat:gagaccacactatttagctt:act:t;agtaataacattg 
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jenomic excn containing AfiCl exon : and 8: 

-catttaccaaacgctcagtaaaagaaaagggtcaaaacgggagaaacgcarzticaccccaaGCCttc^^^ 

aaaggat:Gr-r3tctctctaaQaagt:taaaaacc=rgeccgcagacaaaacccct:c=agt:tt.ggcagtaaac..tc^ 
.33^;„a;^.aLaggcaggnoawacattgttcacaactgc5aggaartaccatgcgccacccactgtgcga9gpc^^ 

a^aatacaaccacggacaatttgagccatCTigcccaggsttccaCagctaccccagcrtcccatgactiggaga^ctssss 

Ictacl!cl|-;l?llcStlttctt;a^ 
-ttqctacl^i"ttSLatctctcacctti=^ 

ittLca;;=-=tcagtrrgtgcttit:aacagtttcccaggcaagttctccci:aggttcggctt:ctagt:gtgttaacctt 

"ttiatalaccgaacccaaagaoacaaagtiagaaacaaaacacctcacctg-.titttgcccatgaatitiactctctatggaa 

ggaacaa"!'IaaLc«ctscg^ancac3gac«ccracct^ 

agaact=CcaacgT:gcgagtcctggcaccctigc-.5aagaaccc5tcccagaagagat:caatcagctggacaagct^ 
^^-I-.^aLcIggtctticctagaaatgcacattgasgattacgcttsgatatr^ 

crcraaac-^Jagaoatitcccacat^ 

caacAa--aaatatat:oaat:cagracatat:aaaeattcgactaaaact:cgacgi:aaaaataagt;gt:tct:acatt:caac.^ 

t^tc^cc-ttctto^aoccctcattaactwgtitcctcaaasccatt^ 
,^;^^,^:;:^;^^^:gi,Liacttgtattggacaca.ccccccagcccacat:caga..«^ 

ctaqtt^taaqtacaacanccccacctcacttaaaaccactaatcaaagaacgagtccgaaaatgaacaaaatgcaa-c, 
tac^; :::aa:a:'cgcag^.gtctt:.agrt.zggt:caggagtcgstttc.tg.«g^ 

M-ttaa--eaettatttatt::ccaatagaaat:^=cagsertacatccgaactcagaaacaaagtnt:tctttCwCatta^a 

CTCTGGCCCAaGAGqtaaoctntntctctccascacoasgaagcggaccacccactgtaccageattetcattoccsagc 

«™caag^°«?"tia«t«aatatcacacgggats"anat=aa£t^e 

;:ftt"l^irc=ta;«taiaa;cgaagaaacccaaaactca.agacg««agagc.«^^^ 

tQsatScccaatetagaaccccctctatettcicetECCgcocccaggEaaccacaiatttgtaaagaMagaatM 

c„„a=-c=cca«lac=oc=ri:tgt:cegac=agc 

"!nacalac«e«g«a;«aaclgc,aacaacatsg«=c=ccaauteasc=c 
gwot-cn^fwalaacawatct^caa^saacatcaagtegasc^gccettt 

ltn;=-aatcc=«t==ca=«<:cagagg===tst:agccac?<:gcaaaascast:cagctctgacccoeascca3t:= 

arr.jactaaacasttogaaaaiacaaaaaagctsagatcasacacta^tgatacgt^agtgcatcccv.cc 

!::rara;§:ra:ragrc:»ra:"gcacz4t»gc==ai.cag=scscggaaagc.gataattttg^ 

aiccraaaliaaiaalaaaaagggascgaa^sccttctgacaagctgeccaggcccagacagteticacccaw^^ 

°!^^:!„r^::Lcaaace-=;=tac««=atcatg!:ccaL5CtgcasaEgscaecaaggcteasaaaggtcagc 

"^!;:"!*:!"aat«-;«U--iacaaaacn^ 

r«;r=ri;:rara:c!===;:ra;4a«5Saa.-,c;aagsagaa=c««aagaaagaaaa.a=gca=cag=agaa=e 
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SSaggacacGcaggcaccacctcagatgcccacccagcaccgaccacctgcctsccactgcraggcgttgaggacrcagc 

aatgaacacgc^aacttctctgcttccttggggcacgtacagggtgacagacagaaacaaacaggccagcgcacaacgcc 

acaggaecca^iatatgcaccaaagaaaaagcaggcraagggscacagagcacgagaaggiigctrtcticctaaagggckrga 

icaggaaacccctctctaaggcgacagctggacctaaaggagacgatagcatgtctgtggtgagggaaggaaacvccgaa 

;aggaagaazggcagai:acaaagacat:zgacgctaga3catgcccaagcaatgcgc::taag9accagggaaagcgagcaa 

scggtscccccacgagaggagctcagagcaggaccaggtgagtgccatacacgcctcgcaasactttggattcccgccgg 

ccgagaigacaacccaccggagggcttgaggcaccggacatgatgtcatctagagtctagactgcncacactctggttgt 

zsggttgagaacagaccgggatgggggaaaaggacQacaaaggacaiitgtgctggac&gagaaagcagtaagtcagcttc 

atcc^ctcactcaacccatgatgttcaaataccacaatcatccgtgggctaaagcatgaagacqcatccctccctgagag 

tcaggaagcacttcccasataaacttcggagtgt^agctgaggtgtaggacaaagagtaagagtttacccctgaaacagg 

cgctgccaacagtcaazagttcggaataactcaa-aatctatggtacc^ccttagaaagatttgctggctctatgtggga 

agaaattwlctrttttitcattggggagtggtgggtiSGtiQgcgaggcrgcctgtggaaagagaagtgagtgttttgactca 

ctgttactcsaaaarcrctagggctgtcccaacaagcaacaaaacccaaaatggcctggttctctgtcccctttctgtct 

Statgcczcgcacaggt-atgaaaagaaaaagtrcagaaaagctgtccacctcacctaattgcgttcttgtggagtgccc 

^agaccccccctctctgcagaaaaaaaatccctc^rggcccctgacccacctctggagagcctagttcccttctcgaggca 

Saaggcaaaccttaggacetagagagtgctiggaccacgccactcacaggaaccagcaggctgtgaggttgaaagctragsc 

atacgcagcrtwccagcctsggt.gcacggcctcg*ggccctt,ccccccccct.ctgtg<:tctat.agctcagtcttcccagg 

cggt3^=aacacgca«;:sacatctccaggaatacagscatt:T:at:taargaj:t:ccctgT:gaaat:gt:ttggaaatacaaagt 

accctacaaacacttcatiaatagcatirggggctgacaactccacaaagtgccggaatacattrgcatgtaagacagaacg 

ctgcctcggccattgaEgcctgticgagvggcagticacagacacttccccagaccttccgactcacgctgttgggactigtcc 

tatgcacgccaccctc-tgtgtggcataggatttHCgccccaccacacactigCtstagciittgctgtcttgatgatgact 

ag^Sggcae-cctccagaccacggcataagcatczactgccccccagggttaccaaaaccaagecaagtitgtgtcccagcg 

agctccctcaagcatcaagaagttcagtactcacagacacgacgtgacttuccaaaqgctgtiaagcngacgagggacawa 

gctagccctcagacttgagtttttctttttcttitttctitcccctttttttitctiaagactgagtcttgctctcgtcgccca 

ggctaca!:tscagtgg;:gctnggct:cactgcaacctccgcctcccggct;tcaagcaattctcctgcctcagcctccccag 

tagctggcactacaggcacctgccaccatgcccggccaacacccrtgtatctttttagtagagatggggteecaccatgt 

tggccagsccggtcttgaactcctgacctcagczgatccacccgcctcgacctcccaaagtactgggattacaggtgtga 

gccaccgcacccggcccagactcgagtctttcasctwaatgcttttccatcgcctgacactttactgagaccaagatagg 

gaactzcacatacagt:acctt:t:tctcccaaggcggaagagcgct:gtt:caattt:ccacacc.agagCtcgssgagt:cttaga 

aatgagccagctatcgaggac.gaaagcagCCcczcataggctcaaccacaacgagatgtagetgttcagagaaagcaccc 

ttttacctataaactcgaagataaccccggtgaaacgaagcccagccccacgggcttcactaactccaggctgtgcttct 

caaactCTiagtgagcacasgaaucacctgggcs-cttgtgaagctgcsgaCwtgaaCtctgcaggrcggcagagoggtct 

cagaa^iccgcatttccaacaatgtccccagtaa-gctgatgctgcrcgcccctggaccacagaccgggtagccaggtitict 

ggcaacctcatcccaaggctccgagatgacatcagacaaaatatgttctgggacatggcctttgagaggccaagaaaata 

acatsti'icr-stctctcctcatccccaaccctzgcactigcccctttctccctitcccctiacccticccttctigtccccaccc 

CtgacccCacCrGTTCAGCATGAtSAAGCTGGASTGACATSCSACAGGACCTGATGrrTCTGACCAATGTGAAC^ 

GCTCC7CCACCCAAATCTACCAGGCTGTGTCTCSTXTTGTCTGCSGGCATCCCGAGCGAGGGGGGCTGAAGATCAAGTCT 

CTCAACTGGTATGAGGACAACAACrACAAAGCCCrCTTTGGAGGCAATGGCACTGAGGAAGATGCTGAAACCTTCTATGA 

CAACTCTACAAgtgagtigcccatgcagaccccasccctgtccccaaccccaticcctcccttagttctsgccttggcccgt 

gtcatcccctccctctgtaccagcgttagatg-ctacatccccattcgcecaccagactgagctcttcctagaggagaga 

ggctictcttgaatagctacc-stccccagtzrrcrgaatgcagcccgccacatctcaggtgcacag'cagtgtt.catcaa 

cggaatcaargattcacagccaaccttccggtttzctgcgcgatgtcgaaccgtggctcccagggtgaticaagaatgaga 

Caatggcagaaggacaaarcccgcaaganct-a^" -atacat:gGaatatacgtaacgt;agaaagt:gt.cagcttcacatga 

tgaacaagtccctgggaccc wgacgcacatcrz5acgactatacctagcaacactgtat:ast:at:actt.gaaafttgct:aa 

eagaguaGai:cc:gaagtgt:t.caqactiacacaaaaaagecaact;atgaggtgatsgat:ttattaacagcttgactgt:ggtg 

at:cctt;tT:acaaagtatacat:acat:taaaacsrcacacrgt:atacctcaaacac.acacaatt:ttt:a::tt:gtcagttccaa 

cLcaaaaaagct:acaaaagca-Ctt:s.aaaaaggat9aC9-£qcggt~t;r:ast:actaccact:ga9ataagcct;tat:aataa 

cataaaaagaaataacacca£cgacaacagc5acaacaacsac3acaaagaactaa=a::-::«agt:3ga£.cztcccgt:gcs 

ctgcccattctgtttaagttracctcatzctarsctcaccsCaacctgcacggaagaCTictctaaccccacatttcaragg 

ct"acagaggctaagcgcct: tgcrtagagccacaccacagtstaatccacaagagccaggattcaagcccaaatctccccg 

catczgtgctctctaagacaactgccagtgc-gccgtgtgccttctcacactcagacatttgatctgccctttgt" -ccc 

artcctagct;gcaaggcagngt:t:aaagaaci:=t:gt:gccrccataticcactccccaeact:taaecaci:tttgtgggcccgt: 

ccgccgnatgcctcgccgcagcaccgatccaa-gtcacacttittaggcaccggcatccttttccttgaaaacctgatgca 

ccggaacctttccccatt-ccaaccacaggzgrgcctttcagacactgaccgaggcaggttctgtactttattgtaacac 

aacaaccttct.ciicctcC3cagt:aaagcacz = cagacacwcscaa3tcgcct:t.acaagcct:raaaaggatggtaCicgtiac 
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SEQ ID NO: 2] 

jcncmic ccnzig ccncaining ABCZ excr.a 9 throuc'r. 22 1 

aczctar-accaaggatsgtctcgatctcctgaccscGcaatccccctstatcggcctcccaaactgctggaatKacacg 

ratgaaccactgcgcccrgttgagaat-tcttcctctzrctctgccagaaacactutcgctctrgttgcccsgcctagag 

tgcagtgacacaatctcgectcactgcaacctctcccccccggetccaagcaatcctcctgccrcagcctcacscgtcac 

cacgcccagczaattttgtatttttagtagagacaagcrtcctccatgtcggccaggccggtctcgaactcccaacctca 

rgtggcccGcccgccttggcctcccaaae^gctccgattgcaggcacgagccaccgcgcccaeccccaaatttnggtttt 

tgct:tgaaaac:;gaggt:ctgaattcagccttctgcttsccct:tcaaeagtcagtct;aaatgtt=gtcatgccact:tgtca 

ctgaaaacaatggtgaggctggcatgagagcgrcaatccgga&gceaccgct&ccgcctcatcaaaacactC'CwCcaga:: 

cagctcactcgcgagttacccgtcactcacgctiataatiaagctcteattattcatcaagcatcattccttacagatatct 

zastttaatccgagataatcttctccacatccccccacatagatcttatgaatsttacttctacagaggagccaactgag 

5c:;caaataagtiiacttantatatgac*:agtiagt:ggtacagctgncgt:ttcaaccaagaaetct:ctggctccaaagccct 

rctaagcttccatcagtatatgaccacccatatcagcatttgtc-CtcctcttcttcatacCTCCTTACTSCAATGATTr 

SATGAAGAArTTGGAGTCTACSYCCTCTrrCCCGCATTATCTCSGAAAGCTCTGAAGCCGCTGCTCCSTTGGaAXGATCCT 

ATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGgtaagctgcccccageccaagactccctccccagaatct 

ccccacaactgcgggcaaaaaactcaacgcagc-rcacaggtgtgcgctaactatacccaccgctectctggaattccca 

gagtgaaaacctcaactctgacgcagaccagagc^gcgccagctcrccactcgcggctatagaatcatagttacaagcag 

scatttc-tiggagatccggaggactsccacagggctgc-ccgatigggctatctctrccagggacgagccaaacgcrcattg 

cctctgct:ccccctcc«t:tttcsgcgccccccggct:ccccacctgaccccacCTGAACAAGACCTTCCAGGAACTascTG 

TGTTCCATGATCTGGAAGGCATGTGGGAGGAACrCAGCCCCAAGArCTGGACCTTCATGGAGAACAGCCAAGAAATGGAC 

CTTGTCCGGctgagtgcccctcccatcattaccasgcgcccgct-cataccccagagetgactittcrggtcacCCCccca 

ggtgtg3ccgaggtgagaattccttC3cttt:at:r*::acct5ggsc5atstagtgagcatagccaaagtcacaggscaccac 

ct:ct:ccacaaetiaca9gccat:ggt:gca9agat:aacgcwgt:gcataccaacat:ccat:9ccactcacggt:caaatagcac^t. 

ttctgcaaaacttagt:gagggccgg"cgttt.ggaagtggacrtgagtaac-gcagtaccctac£tt:cctct;t.tgccgcacc 

cc.ct:cacccagccacagcat.ct,ccct:Swgtct;tggcacgt:tttcGaaacaaccgi:gsgagcaaaagcacgatgtt:acarg 

cagact:9cccngagatactcaecctcagggcacrgtccgaatcargagct:gct:gt:t:actgt:gt5saggggaaacgcacrt 

agtgcttcacagccacttgaaagggacaagtgctctagagacaatitgggttcaaatgtggagcaggccgagcaagaacag 

aacgccccccrtgcctgagcctgagtgccgttiaatcacatcttcc^gccctgagctgagtcagagaatcatitagacca:::: 

tcctcttticcatggcgagggacgcctcttceCTiticgcctctsczccccrcaagaaccaggcgaggatctCgccacfgcssc 

ttgttttcaacct^attigactttaaccgcgaccgg^t^ccagacaccgtacctacctaggssgaacacttccgaagttira 

ggactaczccctgaticcgctgggagecaggttactgagcaag^icccttcaaaaacaaaggagtccatiactgagaaaagca 

t:aaacag"ccar.ttgcatggat.t.cacactsaciraatatagc£cacsccatt:aaagcggggccccccctctaaagcagce::c 

Btatgatc^agccccgtagacctiaactgtggcrticagacccct-cttcccgsccctctcctcggaatccacattitctact 

agttggaccccctctgtttgcctggctcticasacgatcaraggacgccccgcsaagtigacccagtgaattttgatttgtg 

ggcaag^agatcgtcccctagtctrgaaattgaccccgccttacgcgccccaatccttcataagctcccagtitcctaaagg 

acaagatscttgtaaacatggcaatcgcattcactaggaacc-agcuogcaaaatccagtgtgtatgcctggaaatgagg 

gat:ccccgcct:ggagagaaaGccat:gs9cat=ccttsgagggacccccgc=tcaagct5acgac=rT:zactttaagc'^ 

^sgggaccaggcaaggggagacgcagacacg; wactctgatgcggcceacgaactcaacaagga tcacgcaagaatcaag 

gcacagaccagggaggaggctccccaaccggccaagccataaagcaagaaacgaggcctggtigactgcctagfeggcagag 

caccgaaagagagegaggcaccaaaccgagtczcgarttc-acccggctggctgctagcgatgtccagcaggccagtg 

tact:9acgt.ctscagcggaggacggtggctggccccgacacacatgacg3gcaagtcatcagcct:grccctgga 

gggaacctcttccaactgctct:ccttcct:tcc^ccc^ctctr^ctctttttt;:rttttttt.ggacagact:cteg^ 

cacccaggctgaaecgcastGCcatgatctrgccccaccacaccctccgccticctcggtccaagcaaiirctcctgtccca 

gcc-ccagagtagctgggatcacacgcacacazcacsgrgcccscctaattrcrgtatcttcagcagagatggga 

ccatgcr9gt;cgggctggaat,gaact.ccC9a=ct:caagt:gatccacct:gccrcagccCcccaaagtgt:Cggaatt;acag 

catgagccaccgcgcccggcctittcttcccr cz.ct:r5aagaczgt:t:t:atctaacciccacaaacaticagcttgt;caccccc 

tgraccccggcatctcct:acacg5ggtgatggct:gasccccr::gctwccgccgcggtagctccsac""ri:ctgct 

tgccsct rt;cta=ccatc-cg=ct:cacccca-3ccccccagg=cacc"ctcccggccaagt:ct: wCcaaccc 

gac-^ sere cc::w::.-czgag ccgc^tz tags caccca~crzc=9gacc~gT: tit - -"crctrgcctccacccctgc 

CCucaggtctccticcccctcaccaccaccccagagacGccscr-g^^cagrcacctcactgcgcgcancrttctag 

cczatircttitctggccaLgt -Cwtcagaaaccatacicggcsgcgccgactrcaccctaaaggctgcgcctctccacticr 

gc^rrcgtttgtitccaaataaagtceciitcagaattgccaaccccagccrctigtgaacccgtgagguacaattttG^gcc 

t:gct:a5:gctaacaaaaacacacacacacct:::c!:t.ggc5atsctacaaat:tcctaccc'cctatt;gcaaagcaat:titggaat 

caaaactcaaagaaccatitr-aaaacacgc-aTicc-gcgtacctccatcccacccacccccagsgargcagcctaccgaa 

aT:aat::ttaaagaagtcacc3"atgagaga5aat:g--act:5ct:at:a*tcztattgi:gagaaat:tgGaaatagact:aaatg 

tt:cagc3ccata9gaat:aatiraa5aaaat:ca=acazac-rta::aca«i.cart;a::gctigccatt:gaaaT:aai:aaat:ac:aaa 

gQcgcaagggggsaaaagcc::araat:grt:agtgaaacT:aagaCwga::t;:;^ttt:acaaagcagcagc^ztcagacc=-ngg 
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sgactccaattcgg^asaaccagagcttcacctwc^ctgtcgaaaccscgacacgacttgcaaacgcctctcctcrittgc 

raagtttiGcagctgctgtcttcccgccagcacacctigCgcaggcctccgcctcggcccccctccatctgctcaccgagca 

gcggattgacctgtccttccctttcgtgccgacccatgtgaggaaccaactggcaaccgaacaagaaatggaaacaggcc 

rccntnccaccatgacctgtacatcccgcaattggaaaagattctactstacttggcttaaccagcagcattattttrct 

aaactaaccastaagaaggaattaggccctatatggsatcaacagactgggtctcaaaagagcaaggcgatagaacacag 

*3S99acgcccagstgcact:agaaacaga5g3cct:atgcttccattctggctticcc=.acttaacagctgtgtgacccaat 

crcagacaccSaacctc::ctgaaci;ccca^tttctcatgtataaaatgggaaaC3tcaaaggacact:cactgggctg3t;g 

cctCgtgcctccaaccccagcacttggccaggtiTicaactgccaggaccacttgagcccaggtcrtcaagaccagcccagc 

caacatcgcaagactccgtctctatgaaaaaattaaaaactagccagctgtggcgctgcgcacctgtagtcttagcTiact 

iggtaggcticagatgggaggatcacctgcgcccccgaggccaaggccgcggcaagctgtgaczccatcaccgcactccac 

cccggccggcagagcgagacactgaacccaaaccacaacaacaacaaaaggcaaaaaaacaaaagcgccctc&ccatgga 

SCtgtgtaacctgaagcatatac3cta!:::caacatagt.aactatataaaggaagtaccgt:tgrtgttactgtagt:ttaata 

csattaactcagatgtttcgtatagtgcaaagcacatggactctgaattcagaccggtctgactttgagtctcagctcca 

catctaccaacaccatcaccaagccccsgccaaaa wcatgtttccztttcttcagcctcagtcttictcacatacaaaata 

«ggacacrctcatttacctcagtttt:ctgtgag5acaaaacaaccacagt5t:at:atgcaagt:3ttttgtaaatt:t:t:gtas 

tgcCccrcaacattttagwtggtgtttactacttcracrtcctcactggaatcGcacATGCTCTTGGACAGCAGGGACAAT 

GACCACTTTTGGGAACACCAGTTGGATGGCTTAGATTGGACACCCCAAG;iCaTCGTGCCGTTrTTGGCC^ 

GGATGTC CAG TCCAGTAATGGTTCTGTGTACACCTGGAGAGAAGCTTTCAACGAGACTAACCAGGCAAT CCGG A CCAT AT 

CTCGcrrrcATGGAGgtgaatctgttgctgggatcacctagaaaacactitaacggctrctttc^ctgagacgtcacaataa 
ggtt.cac9cacaaggcaagi:t:*:a9aaacaatgtat:agt.ct:cattracaaaaci:ai:ccctcaagcct:aacacaggat:t:t:9a 
caacaaaagacacttaacaaacgttagctigagfcGgctcaatgagtaaataaactctagctctagtiaaattaaccccagct 
Tiattctarataggcccaagagaacatcticcacccattrtcttccaggtcttcacatctcagcgactaatggtagcaaagc 
attcccttiaaaaaggcactatttgtgaaacctaystaaaatcgaattcggcticcaactaaat-tttgaaacttcatacta 
aaaattacaccagtagcgatgggtaacaegtgt^i^tggiiccggttggttgc^iuagtcgctatgactcagaattgctiaaga 
aaacacaaaagtaagataaaaccattcttttaasncctttccctccacaaaaccaataaataacatatccctaaattac:: 
cctagaanttctcttaaactgcagtgaaaaaccaaaatcctccacccttggtcgaacgctggaaaaccacgttagagacg 
attagagagagaggaCgascaatc9tatagtcagcccttt:gcctcc^agtgtaggatt:tgtc^cagccactgci:T:gttgTC 
ctggcr cccaacgttcccatgaaggccgccet = era tCacTCTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAG 
TCTGGCTCATCAACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGAATTACTCCM 
RGCAGCATTGAGCTGCCCCATCATGTCAAGTACAAGATCCGAATGGACATTGACAATGTGGAGAGGACAAATAAAATCAA 
GGATGGcGaagtggaaccccatcacaccagccrgetcttggggaggtccacagcacccattatatcaggacaagaggtac 
tttatcztaactaaaaatttgeiiagaaacctcaacaatiaacaaaaaaactcaactcgCTtgtcacgaccttggcgaaatLsg 
gtacatgacttgctegaaggrctttcataggtcacaaaacaacagcatctttcgatttagcatctctactcaagggaat:^ 
aattccaggaacti:tggtggcaggcacctgt:aa-cccagctaccccgcagcccgasgcagcagaatt.gcttgaacccagg 
acgcaGagst,tgcagt:gagcr.aagaccgc;at:car.cgcac:ccccscctgggcaataagagtgaaact:ccatctcaaaaaaa 
aaaaaagatacaaaaat:agaaaaaggcgctt:gc-aacggcagtagggttttgggcaattttcttitttttrtcti:t:tt-tt 
actgt:a::cgttct:aaaggaatggttsa?i»accr ccggtctggcttcacGTACTGGGACCCTGGTCCTCGAGCrGACCCC^ 
TTGAGGACATGCGGTACGTCTGGGGMGCTTCGCCTAtrrricAGGATGTGGTGGAGCAGGCAATCAT 
GGCACCGAGAAGAAAACtrGGTGTCTATATGCAACAGATGCCCTATCCCTG-rr ACGTTG ATG ACATg t aac 1 1 a C Ctg ca a 
gccacrsCCtttaaccagcttatactgtgccsgatgcgggtgcatataGgtgtgtgcatgtgcatgcatgtgtgaatgat 
ctggaaataagatgccagat:ccaagT:tgtcaa = actit:gcagccacatgacagacatagacacat3cgcacacactactaa 
acctct:t:tccttct:cacccaT:ccttgccactt::-at:ctc::tcatttttatttt:tct:t:tx:t9agat;cgagtctcgctc;t:ga 
cgcccaggctggagtgcact.ggcT:c5acctcg=cccactgcaaccct;tgcc!:cccggcGt:caagctattctccrgccrca 
gcctccacagtagctgsgactacaggctcatcsrgccacgcccggctgac-ccttSwatCT: wagtagagacgaggtcrca 
ecat:cct,acccaggct:agacttcaacc,ccLsagctca5gcaarccacc3T;!:cctggcct:cccaaagtgct:gggattaca9 
gtgtgacceactgcacccaecccaccactC'::aa:itczcT:acaccctaccc'::cttgcccaaaatctgctcaatcrgcaagc 
tcaaaacgtgtcatgacaaacacacgcaagca^atactcacacatacacgcagaaacagcgtccaaacttataaaagcac 
agt:tt:srctaaat9t:Gtccact.t:ct;tctccc::a9c::ggcaa£Ccacat:i!:casaacaacccaaat:aaaactgaacaaaGC 
ttcttcctcttacacttCticcacaaaatctttcagcgccgagTicactaagccgccaagttctcaiirgngggaactatgcct 
tts9a-gtaatgatttcctcraa5acaat:gggiggagcsgtacrtact:gca9acacct:gaaatatgt:aatgt:ctc::::cca 
9attctcgaaattccct:cai:-ctctgcggttcgtggtggi:=g::gggat:sc3tgtccgt;gt:gtgt:gtgtgtgt^ 
gcgtgcagggatcaggatccccgagcagccgggrcctccrcgtattggcr crctgttutgcactgaacagtgt^ 
t:gta^Sgct:atctatagct:r-ccaaggt:cacragaaatT:ai:cctgtt:t:ctcaccti:ccaaacaattacctggaatzcrtc 
aaacgaagacttT;tacaaacacccctaagc;:saggt;ttaci;ctagaaaggatgtct:caagacagggcacaggagc=caga 
ccca^t:aagagccggtccc^g^tgtcatg^ar^gagtacgcccctacatggtlaaagc1:llcgaegtgaacctc 
■ ggT:ccaaaacGtctgt;ccccztctactttgcacatccgcacc:;tct;actctagct::ggaatcZ5aaacacc 
ngccrgaaacgtatctccgwggtgtigactacagttacgataagcaacccaatagtgagatgaccticggagatgttsaaci: 
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-rtgccacacaatgagttigctcttttcttttggtrtittactactrtaacataacctacctctagtttaaatacccctcac 

agctacczacctacrgaagcaagcccccaaagaaa^ttgetittsccaacactccgrtagcctcgtctcccnctctacaci; 

acattgcTicitgaagcactggatcatacgiiaca^rtcagagtctacaccgcccgccctwCCgcggcccagatgrcgtact 

cccccraccatgcaggcccagaggccctggcccaticaccccggcccacgtgtgtczttcttitctccccttgtcijrtccrt: 

5ggS C C t C c acCTTTCTCCGGGTCArOACCCGGTCAATGCCCCTCTTCATGACGCTGaCCTeGATTTACTCAOrGCSCTGt 

iATCATCAAGGGCATCGTGTATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCCTGGACAACAGCATCC 

rCTGGTTTAGCTGGTTCATTACTACCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTAGTGGTCATCCTSAACgtaagg 

ragcctcaictcgctictticcccgccaggaaactqcgaaatagctcaacacgsgctaacggagcagaagaagaaaaaaaatc 

caagcctcccczagacaagg9stcat:acct:gt:catt:t:cct.gcaartt:ca(:ccatt;:atagttagggaaagtgasgcccas 

agsiSgsscactgacttccccaaggccaacccacccgggtagcacctaagtacGatgagagtgcagggtccatsctttcca 

gataaccacacgctcaactgtgccatgctgtctcattggtagtggcccatggcagcatctgaaagctatttattctctta 

gatat3t:tgcgtggceat;tcttcct.aactT:t:ct:aagaacaataaccaGaaccatatatattgttgcaggtt3gactgtct 

cgaagcacacccirgaaacagagttcgatgtatccciiattLatgacgccTicaatacctatggaagagatatggaagatgca 

ccactccccacagggacgagttgaacrgtgatatacgsccaaccccecgcgccacECtagagaatatgcagcttgttcga 

cttgT:t:ct,tcaccgacc-gaaacatccagccc*tt;gt:gct:cccc=aacgcct.c;:ctcctgacaccacci:acctcagccct 

crcaatcaatcactggatgtgggctccccccccaagctcccgccccaggccctacatggctctctgctgctgtgacaaac 

ccagag*£cct:gatgcctgagcccgtc::actcacagct:gggcaacaaagcttccccgaatgcggactctgsgcagtgcag 

ccttccgtcrcaaccatacattaatatatttacacccgaacrtcctttctcrccaagcaccccataraaagacacatcag 

ctaaaaataaatgcttirtgaagcaaaaggagtacaaagacataacaactaactaattitaacactagttaccacccgiicac 

aaacactccctactgattgccaagcactgtttaaacacaccacaiccgcttccccctctatcctcactaacccctttaac 

agacaaggaaatgagsct:ca3gaag9t:caaggact:ctat:tgagctt.ccacact5cganacagctct:cgctaaaagcaacc 

cctcccrcatgctctgtt:aii<:T:aactgcaaceccaac9t:cagcggcagacctagtggt:cccatiggttggt:gcat:aagagc 

tgctcccacacaactgcatgctggcgggtcctigcaGecawgtacccaccacccggagataggctcaaaaiiatccacaaga 

gtttgcatgatitgtigggaatgcagaatccaccctgatcaagagcgaaagtcaagttgcciiggccattttccztggctttt 

agacagaaaagttacgt:gggatat:carctcccac:agctG:: = ctgt:gst:gccaccaatcatagt.ccttatataaggagaaa 

ccagtt5aaact:acctattgaaaaaacaaagaccaaacr.cgcc=actgaaazgc9tagaaagccctggactccgttgt;at 

tcataactcwcccatcatttttctrccgcacttrzgcccaagtcacticatctwct^itaggatggtaatgatcagtttgcctc 

accagaaagargaacagcat:tacgcct:ctgca--gt:ctccaacaC9agtacgaataaaccct:gtcttttt:t:ccgCagat:c 

atacaagcgagcgct:tgggattgttcaggcagcacattt.5aC9rgtct.cttccctcccacTTAGGAAACCTGCTGCCCTA 

CAGTGATCCCAGCGTGGTQTTTGTCirCCTGTCCGTGrrTGCTGTGGTGACAATCCTGCAGTGCTTCCraATTAGCACAC 

TCTTCTCCAGAGCCAACCTaGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCCTACSTCCTGTGTGTG 

GCATGGCAGGACrACGTGfGGCTTCA CACTCAAG ATCTTCGCXc - gagtacccctcgcctrtcttcag tggcr a t a gc c a t 

CcgaccttccrttggagccccvgaacaaaaccaccaagttgacaacaGaacatgactgtcttttccaatgggacatgaac 

cttacc!:ctagattct:aagctctttaa3ggtaaggccaagcaz-gtct:tt:i:at:caaatt;gt:tt;acctttagtcttct:cac 

tgaatcctggttgaartgaatcgaatggaac^^ritccgagagccagactgcatcttigaaccgggctggggataaatggca 

c^gacgaatggcttcaggcaacagacgccat:=t.ccccc::i;tt;Ercccccagctctgttggct:angtt:aagctcatgacaa 

accaaggccacaaatagaaccgaaaactctzgacgtcagagacgaccrrccrccgcctcccrtgcgtrccagtatggtscct; 

tcctsgagtaatgttttctgaactaagcacaas-gaggaacacgtgcrrticarcccacaaactcetgacttggacacticcc 

nccccrcgtacagagcaggoggatatcttgcacagcgtsccaccccccacaagcgcaagctgtcagatgcccccaggcca 

cttatLcaggaaagctaagagrcactcatagca-gccccrgtsgcctcagcctgggcttcataggcatcagcagccccaaa 

cagccaccrctgatcctgagccatccttgcc^gagcaggcagcctcagaacactgtigggtatgcgcatgtgtgtgscgga 

acacgact;cctgagcciitiggggcat:ctttggaaacata6ac^tctaaaag"=t,tcacgcttc:actgtat:atgcatt:cct:ga 

aatcztrgtacataat.gac'cggctacaaacggaawcatzcTiazatcrtacttggt.agcccaccactccctaaagggactc 

t:acacgtaaatactacccctgcacattatg3.xz9accc5Cw::::gcaaatcc:aaat:ttctccaggt:ai:aatct:acac::aga 

agagatagaaaaacgagacccaccaggaaargaataggtica^rictsccTZcrccctcacagAGCCTGCTGTCrrccTGTSG 

TTTT GGGTTTGGCTGTGAG TACTrTGCCCTTTTTGA GGA GCA G5GCATTGGAGTG CAGT G G SACAACCTGTTT GA5AGTC 

CTGTGGAGGAAGA^TGGCTTCAATCTCACCACTTCGGTCTCCXTGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACC 

TGSTACATTGAGGC2*GTCrrTCCAG5cacsccgcuzr=gc=£ZCC9t:r-9ca3aatatGacttctagctgatgtc=ttt 

tutctigctagaatctctigcaccgcatgggcr-scctigcgasczggtTitgggctawagatctatagcaaacagatagticca 

acgacaggcagccgatgcccaaagracaat-gtcactacti-gwacagcacrtgttccttgaaaactgtgtgccac 

ac9caaaatgttttatacac3zcgct:t:c?i:;--aact:c-cacaaggc::act:c;tgaagtagttaccat;aaeaaccagcaatt: 

tccasatgagagaactgugacccaaagacgr^aagtaaccagctttsgtcacacaacrgttaaatigttsgtacgtgeagc 

cgaatccacttcggttacacsgcgtcaataagcccaggcgaa-cctcccaacgcccacccaactctgtattcctc^gtcc 

ccagagggggtacaacnagcagaggttctg* -tcccgagtacaggtccwtaataattaaacanacragetccaacgcccg 

cctg-iatttaattagcacScaacaaaaartcargtiEgaacrtttctrtagtacttcticccttaacataatacatcttcii 

tgaccaagtccaagacgaacctigcgttigcacagc^tricaca-gagaticaaattctgagagagcaagatctiaacccctitt:- 

cgt:rcacctcccgaccct:cccct.aaggacc-3tacatgaaaiattT:actact:cctgcctgaactt:c^t:t:Gat:tgaacat;g 
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caatttrgcaccatgcaaaCtctgaanttiaaattctgacccctaacctactcQctsaSSgaccctggacaggcnccttat: 
-cctcac^ttcctcatct ctaaaatggsS-st^SS'^^cct^ccccgtgsgttgttccaagsacttiacagaggtgcagaatc^ 
acgttccacatagcaccccccagcaaatattagccccctictttccccacacccattcaaatctgttccttctccaaagca 
igtgtcaaecaggaaatagacctsgctscgaaaccctcagaacactgggacgatgctcagcttggctcacacctgtgcst 
rcc 1 1 C C aeCCCAGTACSCfW^TTCCCAGGCCCTSCTATtTTCCTTCCACCAAGTCCTACTGGTTTGQCGAGG JU^ 
SAGAAGAGCCACCCTGGTTCCAACCAGAAGAGAATGTCAGAAActaagtgctgttgaCCtCCtgctCtttCCttaaccta 
^-g^-ccz gc~tt;::gi:*:3act:7t!:acccccaacc5ar3"crcct.5ccctt:ct:aaaagactgtg 

sagaaaccacatgcagcgtccctt tccaaatccccccatgccaciracgcccaaSSCwSttgacccattgggagttcaca 

^cctccasccccgagcgacattttctttGtCgtcrtgccccctacaacagcatctsctacttcccccctcccaaacacac 

atttcatLcgtctcctaacaagctagaacaaasacgcaaagacaaccgtgaccctggaaccatagcctcgctgcccgcctig 

zgacatgGtgacctgtstatcagcctgtctgggctgagaccaaccssctaccacagagctcagcctatgcttcataatgt 

aaccatiacccagaccccnaatcctctcctggctictcaactgcacacagaga^rstccacagctcatcaaaggctctgcwt 

ctgggttctturgcgcttagagtggcttcccaaatatttaacagcccccctctcrgccagtccctitctgtgcccatcccct 

cattgccctitggtaaaagtatgatgccccttagtgtaccacgctcccccgccgcccctaaccatcttctcctacctcccc 

ctcacacccagcccctccctcactcaecragaaatgc^cacagtcactggaatatigtcatgtccttccacacctccatigc 

ccttg^aggt3Ct:gctcgctctcacaccagaacts.tczctctaact:tccccatctzctcaacT:cctcctttctccccaag 

atctagttccggatcccctcccccgagcatccctcGtcggttcCwaGgcagccactcactctctgccctgaactCccarg 

gcacgsgaaagaaaac^rttttattccaaaacaatcacagactiacaagaagtaatacaaattacatgagggggctccrct: 

LaaaccsttcarccaccttccccaacGgtagcaccacgtgtaac^gcagaawagratcaaaaccatgaaattcacatagg 

tacaattcacaaacccccttcagatcrcactagctttatccgcgcccac&tgcgtstgtccgtgcgtatttagctcta^g 

caattctaccatgt;gtgaacccatataat:t:actaactcagccaaac*ccacaaatacctcatt:gt.cacaaagct.ccCwCa 

cgctaccccttaatcgccacagccaccrccccie^iCcctcactJicctGacacctgtcaaccactaacgcgtccctcgttt 

ttacacrtttattatt:;:ctiagaatgt:tacat:aaat:ggaaccacacagcaccraccct.t:tt:3atactggctrt:tttit:t:t.t:c 

ctccacccagcagtatccccttagatctatccaagttgtgtgrgccaacaccccatccctctticactgctgagtagtcTit; 

ccccsgcaggggtgcaccacacctccaLggcatitcttasatgtatCwttsiaaacagccttcagcatcctctatitccaacc 

gttcarcaagtcctttttcccaacacacccccaaccctccttcatcatcccattcccatcaecaacatcagtacccaaat: 

aggcccraaataaacatttacagcctcccgcctgcc-aagaaaccaaggtccacatggagagaaggcacttc'cgaaagtt: 

caagcgcagcgcBctctgtcctcacactGcactcct cagtgct^tcrgtgggttcacctccgcctcctctcctgCcacag 

TCrGCATGGAGGAGGAACCCACCCACrrGAACCTGCGCGTGTCCATTCAGAACCTGGTAAAAGTCTACCGAGATGGGATG 

AAGGTGGCTGTCGATSGCCTGGCACTGAATTTTTATGAGGGCCAGATCACCTCCT^CCTGGGCCACAATGGAGCGGGGAA 

GACGACCACCATgcaagaagagggrctggtccccgcagaatcacccacaggagggtcccgcagtagagttagaaattriai: 

accttaggaaaccacgctgatccstsgcccaagggaagcagcacacgagcagttgccgaatgcgaacatigttatctaatc 

atg3Qtgt:ctttccacgtgctagcs=sct:aga:;cttatct:oc;:c3sacT:aadacaagctgggg<;ctcagatgacct.t:xcc 

catgtagttcacagaattictgcagcegtcttggaacctgcacccacgaaaagacagattacatatgttggagggacrzgg 

taatccccaggaacrccgtctctaaccagatg^gacaagcaccicuQagacgcaatcaagctgggcagctggcttgarcg 

ccttccctgcgaccticaagsaccctacagtsggtagtatcaegagccgccacgggctgtaaagcaccagcgttagcccca 

gtggctcccagcacgaticcc'tcaaccactctiaaccattccaaacggtacatcrttgcggggcgacattctttccctCwtt; 

CcttczcaatcttctCttaaaacaragaatraacacattatgaacctCtcagaagacttctaaaaggcagtcagaaaicc 

tactacctaacacaaaaattgtictctiaiicttrgaataawatgtticccgtittrgtccatttcccatgcatgcgatgttiaggc 

atacaaaatacatctttcaaagaacactttca-cgcaaattrggaaaccccgctraaaaaacgctcatactaaaattggca 

Ct:tctaacccataggcccact:tgtagttatt-accgaagcaaaacgacagcttt:gctCtgt:gtg3gtci:ggtaggc::wca 

tT:acaaaGgaatcggggcggcggcagggx:tc=tgiit:ctgttct:ccct:ccagactgaatcgagcat:ct,agagt:taacgg 

gStcaaccctgacctctgtac^tc-aaatt:t:-cctc::ccagGTCAATCCTGACCGGGTTGTTCCCc:cCGACCT 

CGCCTACATCCTGOGAAAAGACArrCGCTCTGAGATGAGCACCArCCGGCAGAACCTGGGGGTCTGrCCCCAGCATAACG 

TGCTGTTrrGACATgtgagx:acca9cagcac9Ctaagaai:aggcct::ttcrs9acgrgt.gC3tgtcatgccatcacrgGag 

gaccgggact.::aagcat:ttt:actct:gct:gt:grcttt:gt:!:tsrcctccrt:ttccttttt:acttttt:tgagatggacrcccg 

ctcrgragccaggcrcgac*gtactggcscgatc::s5ccccactcc5£cct::ggcccccca9gtccaagcgactcz 

ccccagcctcccgagtagcrgggactCwaggcacacaccaccatgcc^acctaatter^igCgtrtitagcagagacgggg 

tttcaccacgtrgsccagga-gg^ct:caarg=c:£:t:gaccccgtgacccccccacctcgctct:cccaaagtgctgsgaaca 

cacgcatgagccaCwgtgcctiCSccacai;r":iacrttcutt;gaacatggcacgctcacctc:cgtgaacacctCgagaccT; 

agt^gt,tctttgat:ttt:agcaga5gtggsaggt:gaatggt.tgagccgtagaggtgscatcagcccagccagtggaz3ggft 

gctrgggaaacattgctt:c!:caCiactgt:caT:gctggagggcccrttaccccatcct:ctccccccgccaccct:ccct.att: 

gagccctggagcagacrccccagacctgg^agtigctticagsgccctggratgatggacctatatttgctgcttaacacat 

ttgctLCCcactcagsttgtcccatcaccca-aacgccGccagggagcccgtgcgatggagcagagaga^acctgagccct 

gcaarccTigggcaaggccrzzccct:cat:ct-cctt:ct:tatctaaagt:caacagctagggctcatct:gctccctcczc 

taaagcgaacacatggggc'^caigtgcacggtcctccccgcttitcagagcctgacgt.cccctgaggctcagga.acgccac 

tccaggtgagtgccgagCTigaccrctrgc-ggacgtgc^igtgcggacagcccattaaacaccacaccctggggccrtsa 
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actgaaaettctaactccctggtgcatcetggccacccczgctgcaaacaccctigcaagccarccgtcaccttrcactt:: 

;at;ctcct9acca3ctcatgt.ggttgcrcact9t-3tti:tacctw3aat:ctccaacattat::tt:t:cagaaattgat:aaa9 

r-actctaaaaagcaccgggagagaaaaatatgcscactctcatrrgtsctgccccaggggacactgtattctsgggtat 

SCagCaccgcccagagcrgacctgcCtCCCtgtCSCCacGCTGACTGTCGAAGAACACArCTSGTTCTATGCCCGCTTGA 

iAGGGCTC7CTGAGAAGCACGTGAAGGC3GAGA7CGAGCAGATSGCCCTGCAl'GTTGGTTTGCCATCAAGCAAGCTGAAA 

AGCAAAACAAGCCAGCTGTCAGgtgcgscccaga9ctaccrt:ccccacccc::ctcccctccr=ctccggccacacacatg 

cgoaggaaaa'tcagcactcccccagccicccagcctccccgcgctcgstaacagaaactcguccctggctgtgcccct 

r-cctictgcctccactcactgcctgggcccggtcctgcaacttczcctgctcrgttctcccccagGTGGAATGCAGAGAA 

^GCTATCTGTGGCCTTGCCCTTTGTCGGGGGATCTAAGCTTGTCATTCTGGATGAACCCACAGCTGGTGTGGACCCTTAC 

TCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACAAGccgcctgacgcgcatttacrctgagiraaacsgactisa 

gacagaccgcggggcttttgagaagtgtggctgratctcatgcczaggctcctgcsaagccatgggatactcctctgtta 

kcacagaagagataaagggcattgagactgagaszcccgagagcacatgctatgcctttattcatctcctcgtccccaac 

acggticactaaatttatggttaottgaaaggctagatgcttaaatgaatggaagccgagaggggcaggaagacgactgg 

accctctggttagagacctgatgtggt,acagta:;5accaccacacgcagcct.wggasccaact:ctggcttggccccgaga 

caccgggaaagtcacaacttgcctcacct:tccr:rgcccai:aacaatactgct:9cgT:i:acctcacagacgattaaat.taa.a 

-gagaa^ccacacaaaccacctagcacaatgccrggcatat:acc3agttcccaaanaaaatgcgtact;gttcttacctct: 

gtcaggacgtrggtacctatatatacaaagctcccccacrccaeggstcacagccatacagcccgaaaggtcgctitecagg 

cctcttrcaatgcrtacccctgccaatacctccccagwcccte^cac-gcgacaaatcagaaccgagaggcctcacctg^ 

cccacatlcctCgtgttCgtgcCtggcacSCCGCACCXTTATTCrCTCTACACACCACATGGATGAAGCGGACGTCCTGGG 

GGACAGGATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGCCCTCCTCCCTGTTTCTGAAGAACCAGCTGGGAACAG 

GCTACTACCTGACCTTGGTCAAGAAAGATGTGCAATCCTCCCTCAGTTCCTGCAGAAACAGTAGTAGCACTGTCTCATAC 

CTGAAAAAGctgagctgcagtctitggagctggcccgct9ttig9ccccgggcacccaggaccCgctgget.gtgaacgatt:t 

c^iccatctLccaccccttttgccatgccgaaaccaccatc^cccrgctcccctaccccttccaaatcatatcatacttaag 

pcatggaaagctaaggggccctccgctcccattgtgccagttczgtrgaatccccttittccrtttcctatgaggcacaria 

Sagt:gat,ggagaaggtcct;c.agaggacattatt:atGt:=aaaGaaaaeagact:tgt:caagacct:aagagcctt:ggct:acaa 

acgaccrg5tcgttacwscccattactcttca5-cctcac.zaaccwtaac;:t::taaactacaaaacagccaatact:catt;a 

gecac"gatttcacgccagagacactctgggca::::9aaacaaac-aacgataacaattaatc:t:c.aT:at:agcgi:tgtt:acc 

atttcaacctcttCttcttttCaacctictatcatctcaaztaaag 
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SEQ ID NO: 22 



Jcnomic ccr.cig ccr.tainging ABCl excr.s =3 co 23: 



--aaacacaca-caaaccataaaaaacataaactagacatstagccagacaaaagcct-gctgagatggttcgc^ 
::cattgcac«rtca!'ggcaggcca« 

-Sacc"ct??a^;;gcatagaaallggt|aagaatcttttaaga 
^.agggaacctacctta^aa^ y- - - ^^^^^^^^^^^^ ^g^^^ 

aaggggatataaaagctgrccggcca^y ^ ^ 3 ^^^^ 

r.r=";gra"iis"rc^^^^^^^^^^^^^ 

ccgccac=atatttc=agtscrc«=«=«==.g=^ 
gatttasasP"«SWCtga»agaagc=ac.^« 

!a»a""^^c"a"gcra« 

^^^^^^^^^^^ 

^^^^^^^^^^ 

cctcraaagacataaasatcgcgcctcaat:s=cccgaacccBcgt:atTi.»nta^..ttc^ 
aaacacaaaaaactascccaccgtggr5gc^catgc,,gt:catcccagc^ac y 
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"=caccaacaQaaaae=ttccagt.tc:itcacccct5cecaaacagtgta«sge=c(:tacaEtatacauaac^S=ci:-a 

:;„t«cc;ctgt:3«ttctcLara=gtcaagccacst:acatggccts==tcatagagt:ascc«taa9aaatt3^ 

icwaaltcacccgfacggctggctgtaacacctggccacgcatsaacacctettttcagtt:^ 

"tcla;«""«'allagi«tla»asgcasai^ 

;;c"S"=;tgaaaoce=«lact«-.glgaaict=c=agctt=agacaca.gccttcaaga^ 
.Ma=««agMt:OTcaa=gccgaaaacacctaasaraacaggtgacct:castgcCggcStcttgcag»ATCC*^^ 

°*^;;^eSA*GAGACTSCTJU^TTSCCAGACSGRGTCGGJUL*GOATTTTTTGCTC*OgtsagacgtgCtgt«C 
MagI^«gg<:ttcataggtssgccscagg===zscgaccas=gaaggcacsaLagcatc=t^^^^ 

=|q!=c=ac!l"at:cti?ac««a==c=a=trcca=tcct£ 

«lcl"c!cctca=^icaaaat«c«taa-aata«accccca^ 

-«"a;«Ig=goc«gacattaitaagt:scztas;:=tcgugtgCgca:ats=" 

iMocalaniwtacaLaaccggt-.gggtSCattcasctgicaEaacatsagaSttg^atgcccasatacacttsac 

at««a"tI"a«aacacgal«tSS£ct5asttgacg 

'c™l"acagBga;«ctcg?casae=attg«tis=atgt:ca= 

a-l?«--"cc?a«aaa«aa=aaa=ata5acaaaassccrgcagasagcc5tt:cacc^^^ 
a;^c««ac«gea«acacaia==rcaa«=gaa«=tsact«gc 

cl«!l-«l"tlat«tcactta=«tt3C«st:=saEgarsaccat=3a«acacaaccacaaataat^ 

laaa-!arc«calnanaataattac:ataca=aat;attcacccaatg« 

!a!a;"ao"gca«att^ga;cc^.;aag«^at:aagsaacacac«c=ca 

«"ca!lll«acccaU?catttr-.t«ca5agaai«at=gtacaaagcctSagg 

-Cta-tc-J«tga«cJt:caegaaagataa-.oac«cgg«gt==ats 

^="«cltg|«ac;l«gSiccasa;alacaaccaggaCtcagsa 
'=n""««ca=t:ot=«igas;ag3aa=5Scassacc=agtsttcraagt=acccsg 

acacggcggaac 
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genomic ccntig ccntaing ABCl exon 23: 

gcaaaccattcaaaaaaaaaaadgcata^ataCacataLatata'acatatistiaacacgaatcggcctcccttitictccaa 
gcccacatctccctcccacatagcccacgcc tact t tat -tCE sect tncccgcccccgaccctgtattgccccc age tg 
cggaacataccacgtctctgtgacctgtgcccgctacctccgcgctttctacticgtgcatgcaaagagtacaaagttttc 
ctgccct:ttcctggaaaat:cctgcttgcct:gCQccaaaggaat.aactgtgaaagcacttrtcaaatncttaatgagt:t:ca 
czecc'r caaati:aaaaaaaacat:araaacgtatacgt9t:at:g:i3cat9t9*gt:acacacac3Cacctutatacat:acag 
cccacccaaaacaagctccactrtgcactgctctacgccacccccatgccgaacacagggccagagtctigagarccttct 
ggcngccttitgtgtcttgttcatttrctgttttaagaccctgtcacacagaaatacttcctaaaatgtttaattcacaaa 
aacatrcttatctctcgattactggttztaatgaatcactaagccoscngcc^crtcatgtacccacagCAATCATGCTCC 

TGASGACXCGSGAACCCTGGAACrCTTAAACGCCCTCACCAAAGACCCTGGCTTCCSGGACCCGCTGTATGCAAGCAAACC 
CAATCCCgtgagtgccactttagccataagcacgccttcctgtgcttcttgcctgctttgatttctaatatgctgcattt 
atcaacTrgcatgccacattgtgaccgccagcattitgccccCTigaattactatcatettttactcacaaaoagcsaageta 
gtaaccgaact:aaactat:c^aggaacaaacgcttggacaat:cctct:aacaccgyscaaagcacgtcat&acagacat:t:;:g 
cccac'cgattcagaaccttaarattcaat:t:taaa^acccact:ccacacctaccg3tgaaatgctt:ttcctttcet:^tctct 
cccagcccc-gtacctaagtgcttcaatiaggctc-catTiacacatgatietttaggttttgcttatcagctrtctticgcctt 
^ataatcccaaaagaCcgcatatgaacctititataaaaagcgacaccttictrccLCtcaaaCti^tatatttttatitigtiact: 
tttcctccaaaaccceczctcaaaaagcaagcag^^cacaaatiaaatccagccaaccatccatatgacccttaagtgactig 
t:asgcsaaccgaggtcaccagat;cact:gT;gagt:caacatggt;gGagaggtgaGcatcttaCgaggccgtgctcaaggc::g 
acagacgccggttagtgtttccaactttaggcacaatcticagctsagcccatgaaacaacagcgatctctgaaaaattat 
ggcaaggcgcgaaggtgctggagaattggagagcggccaaacttgactCtcaagtrtcaatgggaagat.aggtgact.c'Cg 
cacaccacacaacagtcagcacgacaacct:gtt:r.atacaaggctct:aoagcagatetctaaat:ggatagctactgt:g-::9c 
ttgttrcttcrtaattagtatrggatagttactaaatacttgttagcaccragtacataatgggtggnaaatcctageag 
ctaacaccggi:tcccaaat;aaccagatgacaaggatagagaaggacacagacaCGgcctatctiggatttcaC3gt.gcctt 
tgatCCtccacatgaacgctgtgtagggaagatacaagcatrgaeatgagatgataatatacttatctggattcatcactg 
gccagctgaaccatatgaacccatggaccgatcccagcttaggaaggcTicticcacgagccagaactgggccgagagccag 
cccat:agagacaaaacaggcccggccccgacatcaga«gent:caaacacga::gtctgagccccacctacagtccgccgga 
ggtggcnggaaggaacagcctttatccttacaarticttiactgaaactcaaattcctaggtcctgcaaaaaaatggtsgac 
ciigaasgaaacttgacaggagcacgCcLcagctgtiacctaaatcrgticccagccaatccccttttgaatgtitcagasrgt. 
aagcttcacgagggcagcgcgccctagcgcgacztrttctggtcagttcaggcgctttaaggagacaattagagatcaaiic 
tggaaaacttcatttgaatitit^taatiacatiaacaaaacaatiaacaaatagctaaaaatacatatttatataaratzatata 
cgtgtctgtgcgtgtgtgtgtgtgtgcgtgtacacatatacatattttatctatttatttttttttgagatggagtcticg 
ctctgtwgcccaggctggacttgcagcggct:caatctrggcccact:gccacctctgcctcccaggt.Ccaagtgatitcc,ccT: 
acctcagcctcctgagtagctcggatcacaagcacgtgccaccacactiggctaa 
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Genomic ccntig ccticaining AfiCl excr.a 3 0 and 31: 

tcttsccagtccctacscatttttcaccaGatcsagcataagatccaaactctt-cccaegcctctcccacctggcCcct 
cccctcctcctttatcaCCacccttcttcgcaccttatccwactiGcacccaccctigtcttcctattatCcccaaaaarta 
csaatacacttcrtcccaggacctttccacctccac^tcccatrgcttttcctcacaatcttctttttgccaagcttttg 
cccagcttgttictccascatcgccatgtttcgccrgaaacgtctsctcccagtacccccatcciccccagtcactgtctt 
cctattttcctccattctgggccatctaaggctarctcactagcGcatttcccgt-cgtcticctccacgggcacacacct 
ccatgaacgcaggtatcctcaccttacsccctccaatatacngaacagcatictagcacgtactagacgctcaacgaatct 
tcgttscgtgagcaaacggttggtrgactagatrgaactgagctcagtatgtaaacatttagggcctctttgcattctiat 
Cttactcacstataaaatgat:acataat9atgat.ataaatgacgccacagtgtacaaggctsttgtgggatcaagcaat:c 
aaatgagatcatgctcgtcttttccaaatggtgaccgaatagacccatgtczgcggttgctacggaatgatcctgtgctic 
ctgaegcaacagaaacgceaggccatctctggtaaccctactcticgctgtctrccctctgcagAGACACGCCCTGCCAGG 

CAGGGGAGCSAAGAGTGGACCACTGCCCCAGTTCCCCAGACCATCATGGACCTCrTCCAGAATGGaAACTGGACAATGC^ 

AACCCTTCACCTGCAXGCCAGTGTAGCAGCGACAAAATCAAGAAGATCCTGCCTGTGTGrCCCCCAGGGGCAGGGGGGCT 

CCCTCCTCCACAAgtcagtcaecttcagggggtgacceggcagaaggogtgcaggacgggctiggtagcttccgcttgsaa 

gcaggaatgagtgagatatcacgtcgcgagggrrtgtctcagtctctttcctttctitgttrctttttctgaggcggagzc 

tcgctctgccgcccasgctggagtgct:gtggcargatcttgcct:::actgcaacctccacct:cccaggttcaagcgatt:ct 

cctgcctcaccccccrgagcagccgccattacacgcacgcaccaccatgtctggccaacttctgtgtttttagtasasac 

agggttccgccgtgttggctaggcugenccggaatccctgaccrcaggrcacccacccgccrcggccccccaaagtgciig 

ggatnacagccgtgacccactacgcccagcccrgttccagtictiztaactccctitcttgtcataagaaaaagcatgtgagt 

tttgacgggagaaggtttggaccacaccgtgcczatcccrgtcccacagcagcaaagtcacaggacagaccgtggcagcc 

ctogcttccaatcttagctctGcaacaaacgacccsgcageccrcgacagcccrgggcccgrrtctccacctctgaacta 

SSgaggctggaccagaaaactcctgtggatcttigtcaactctggraCTicnTiagagactctgcttgggaaggagtcctgag 

ccatccctcttttcctgagaatittcaggaagaggagtgcttatgatagctctctgccgcttttatcagcaaccaaactge 

aggatgaggacaagcaaCtctaaatgagnacaggaaccaaaagaaggcttggttaccactcttgaaaataatagctagcc 

caggtgcggggcggctcacacctgtaacctcagtiattttgggatgccgagecggaccgatcacctaaggtcaggagticg 

aaaccagci;cggccaatgtggcgaaaccctgt:c::ctacsaaaaacccaaaaat:nagccaggcatggtggcacatgcG=st 

aatcccagttaGttgggaggccgaagcaggagaactgcccgaacctgggacgtgsagctcgcagggagccaaaactgcgc 

cactataccccagcctgagcaacacagcaaaacticcatatcaaaaaacaaaacgaacaaaataacagctaatctagcciat 

cagcacaactccagtgaaeagaagatttattacgcatagcgaargatggtigcctcctaaaaatctctitgaccacaaacaa 

tctcacttcaacgtttattgtttagacgttcagaataaactct-gggaaagaccttggcttggtgtaagtgaattaccag 

tgcceagggcagggtgaaccaagtctcagtgcrggctgactgagggcagtgtctgcgacctgtagtcaggtttccggtica 

caetgtaaacatggtcactgt:cgtcctTigattzgttt:tctgtt:tcaattct:t:gtctataaagacccgtatgcttggt-rt 

ca t gt: ga Cgacag AGAAAACAAAACACTGCAGAr ATCCTTCAGGACCTGACAGGAAGAAACAXTTCGGATTATCTGGTGA 

AGACGTATGTGCAGATCATAGCCAAAAGgtgac-tt-ctactaaacttiggcccccgccttattattactaattagaggaat 
taaagacctacaaataacagactgaaacagegggggaaattgccagattacggcctgatcctgtctattggaagtttacga 
tattaticccaaaccagaaaagacgacgagaggcactgticaacatccagctigtcagcztcaaggccgagscascctgg'tict 
agaatgaaaatagaaacggatticaacgtcaaatztcgccac 
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SEQ IK i\0: 25 
Genomic contig containing ABCl excn 32 

gcacgccggagtgacag&gaccatgastttctaagaaagaagcataatttctccacatgtcatccacaattgaaatatta 
ttgctaattgaaaaa9ct:tctaggccacgcacgctggctcacgcctgt;aatcccagcacttcaggagccaaggc9sgt9g 
atcactcgaggtcaggagtrcgagaccagcctgcccaacatggsS^^^t^^^ccS^^^^^^^^^'^^^^^^^^^A^^A^SCtg 
agcgtcctggtgcgtgcctgtaatcccagctactitigggaggctoaggcagsagaactgcttgaatctgggaggcggaggt: 
cgcagtgagctgagtteatgccattgcactccaccctgggcaacaagagcgaaaccatcrcccaaaagaaaaaaaaaaga 
aagaaaaagctitctagcctggttacatcttggtctataaggtgcrttgtaaaccggtttaacccaaggcctiggttcccat 
ataag^aatagggtacctat:gatggagagaagcccggaagaggcctgaacacaggcttcttt;tctctagcacaacectac 
aaggccagctgattctagggttatttctgtccc^tccttatatcctcaggtggacacttactccttctgcatcattagga 
a t agg c t c ag tg c 1 1 1 c 1 1 1 gaa c t g a 1 1 1 1 1 1 cr 1 1 c 1 1 1 g t c c c 1 9 c a g CTTAAAGAACAASATCTSGCTGAATGAGT 
TTAGgtaagttgctgtetttctggcacgcttasctcagggggagcatggtgccgtaggcgtgcttggattgaagaaagcc 
ttgggsattgtttgtcactcacacacttgtggc^gccatcccactgtgagaa 
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jenomic ccr.tig ccntaining ABCl excns 33 co 36: 

ccctcacacactttctccctagagcatcatggcccagtccGcaccaaccccticcacaagcctccgaatatctcaCacact 
gacttccttcaggagaatcagaaacctcctgcagcrgtctagccacttcaagtaagtagtgttgtgaggggaatacctac 
ccgtactscccccccaaaccagatjtcccgaggccticttaaggacccaaggacaacctctaggcattcagcacgggactaa 
aaaggtc^tacaggaaataacaagcgecaaaaccatcrctttgcactgtatttcaacccatttgtccttctgggttctga 
aggaacaggtiggactgcggacagaagagttctcgaagccagcttgtccaccatsgaaaatgagataggtgatgtggcta 
cgccaggccgcccgaaggctccttgiitactgactrccgtctctcctc^ictisccctttccccaagggccaggacccctgca 
tctctgcccagagcagacgcaggcccctataataccccticatgctagaaaggagccggagcctgtgtiataaggccagccc 
agcctactcrcgacaocgcagggctcccactctcccaactccccatctgcncgcctccagaccGacattcacacacgaec 
cactgggtccgaggagcacctgcgagatgaaacaccattcttccctcaacgtLcccagctatccaactgcgtgtgtaatca 
ggccacctcctccctecngggcagaaaccatgggagccaaaagazzgccaacat'Stattagaggaagctgacgtigl^aacr 
tccgaggcaaaatttagccctccttcgaacaggaatctgactcaccgaaccctccacacactcgcaccgagtctigc^gct 
gacganaccccgcaccccaccgcctogsctttaatigccaggctc-cccctcagGTATGCCGOCTTTTCCCTSGGTCTCAG 
XAATACTCAAOCACtrTCCTCCGAGTCAAGAAGTTAATGATGCCATCAAACAAArGAAGAAACACCTAAAGCTGGCCAAGg 
taaaatatctatcgtaagatgtatcagaaaaatccgcacgcaccrgctggoatataggagcagttggcaggttaaacgga 
tcacctcccacctcattgCtctgaatacgttgccatacagagccgtccctggcatttagcgatttgagccagacaaaact 
gaatcacctagttgtacgtctaaaaetgcaggtcaaaaacaaaticcagaggccaggagccgtggcccatgcctgtaaticc 
tagcacttcgcgaggctgaagcgggtggatcaczrgacgncagcagttcgagaccagcccggcctacatgacaaaacccc 
ccatct:actaaaaatacaaaaaaattagctggcczt:gct:.ggcacacacctctaatcccagct:act:tgggaggct:gaasca 
cgagaattgcttgaaccctgtaggaagaggttgcagticagccaacatcgcaccgc-gcactccagcctgggcaacaaGag 
caaaacTiccatctcaaaaaacaaattaaatccagagatttaaaacctcteagaogctgggcgcggtggcctacacctatt 
accccagcaccetrgggacgcceaggcgggcaaaccacaaggccacgagiictgagaccagcctggccaacatagi^gaaacc 
ccgtctctgctaaaaacatagaaaaactagccccgcatggtgccccgcgcctgtaaticccagctactcgggacgctgacg 
tgaaagaattrcUtgaacecgcgaggcggagg'C"gcactgagcccagattgcaccactgcactccagcctgggcga<;aca 
gcaagactccatcticaaaaaaagctcticagaac&accaggcttacaaaTittggtcagtcggraaaraaaccgggtrrtcaa 
acaracttccccgaaayaatcaccgactaaatacgaaatgaatctttitcttictccttttttaagctggcaagctggtccg 
taggaccrgataagtactcacrzcactitctccccgtCTicaggt^zcccatrcttaggcgacaattaaggggctGtgacaa 
aacagaccctaggattgtggacagcagcgatac-cctagagtccacaagtctgctttcgaacgatgggcccacgtatctrg 
gcacatctigcaggcagagcgtggticcrgsctctccagatgatgccgetggagcactttgaggagticctcaccccaccctig 
ataaccagacattaaaatcctggggcnttgca-cccaggacttctctgtgatcccttctagacttgtggcatcatggcag 
catcacrcgctgragacttctagccacttggttc-caggagccgrzcattTiaatggctCcacatttaatttcagtgaacaa 
ggtagtggcaccgctcttcacagggccgtcccgiicgtccacaggttccaGattgactgttgccccttatctatgtgaaca 
gCcacaactgaggcaggtttctgttgtttacacGACAGTTCTGCAGATCGATTTCTCAACAGCn^GGGAACSATTTATGAC 
AGGACTGGACACCAGAAATAATGTCAACgtaaaccgctgtcttrgttcragtagcttCttgatgaacaacaacccttacg 
tttcctcgagcactctcaactcacggcaaagt-ggcaggagcarccacaacagaaaagagcaaactattaaccttaccag 
tgaggcagtac3gtgtagtgtagt:gat:tcagacaatccgct.ctgccaccagacataccaggtaaccttgactaagtt:aet 
taacctat:ctaaaccccagttycct:catctgtgaaatggagacag.taatcatagccatttccaaactgttgtgagaa;:rc 
aatgagt taaaggtacaaggtccccaccacagcgcccgcccacacagccagcgatcactatg'Ccctgaacactgtiaatra 
cctcgccatactctctgatcacagtgtettgcc^cggtacctg&ctagaattrctttctgaggtttatgggcatggtir^gg 
tgggnargcacctgcctgcaggaccccggtttggacgcattacactgtacctggcacgttttcttccagGTGTOaTTCAA 

TAACAAGOGCTGGCfcTGCAATCAGCTCTTTCCTGAATGTCRTCAACAATGCCATTCTCCGGaCCAACCTGCAAAAGTO 
AGAACCCTAGGCATTATGGAArTACTGCTTTCAATCATCCCCrGAATCTCACCAAGCASCAGCTCTCAGAGGTGeeTCT^ 

taagtgtggctgtgtctgtacagatggagtgcggcaagcgacagggttacggagaacgggagaaaaatg'tsaatctcatt 

gtaggggaacagctgcacacaccgttatattacgataaacccggattgatccaggctctgggcagaagtgataagtitrac 

gaactcgctggttgggcttctcgaactgcacaagagaaaatsacaccgacacgtaaaaatcgt:aacat:ctagt;gaat:r.ca 

tacaaactgagttcaaaaatrgttaattaaattataatttaaticataagcecrztaatcactlitgatttgtttaaaaacca 

ctccc-icaaatctggtggaaracgtccucac-agcttgtiatic^iwCaatticccaaatcaagctgtgtgcgcgtgtgcg-gt: 

gtgcgcgtgtgtgtgtgcgtiszgtigiigtgccaagtc taaacc=agGa=gagctagt;ttaaacl:aticcagcctttgGag 

anacagatctgggt;T;cgaatctggcctctaaacrt:catacat:gi:acgatatt:aaatgaggcagt:tcatgtaaattgcc:aa 

gc^rcagcacccaccacagagiiwcatattticacacacactiaca-acctttcccgtatgtggagcatggcagttcccgctitc 

cgcct:*actcci:acaggata.ctaat:acaggacact:aggatct:"::i:acaccaagaccccatgtaat:gggctt.atgagaccat: 

tcctcctataaaaatccgacagaacctttgtaTigcgttagaccaatiaggctgcatiactgttattCtcaagttgattiraca 

gccagaaaractaatttattrgagtagttacagagcaataccrctgctcteatttagttttcaagccccactagtccctt 

gtgtgtgaaaattcacaactraccgctettacaaggtcacgaacagcggaccaaagtgaatgccatcaaccactcrgacc 

tccttcattagtt:ttattgt:gacagt:ggacw:;t:tc=.gacGtcact:aat;accagtttggcatttacatt:gtcatat:trtt:a 

gact:£aaaaatgaCcat:ctcaaGcCtgaataaaatgt:gccccgx:gaacagacgtti:t:tccttggct:gt:gcctca9aKatc 
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cctgcgtstatgtaccftgtgtgtttgtctgtgtctccatgtcctcactgaitgacccctaactgcatcaaagacccctca 
ga ttt tcacacgcC tt C t etc tCCagCATGACCACATCAGTGGAXGTCCrTGrGTCCATCTCTGTCJ^TCTTTCCAATGTC 
CTTGGTCCCAGCCASCTTTGTCGTATTCCTGATCCAGGAGCGCGTCAGCAAAGCAJUU^CACCTGCACrrrCATC^ 

TGAAGCcrGTCATCTACTCGCTCTCTAATTTTGTCTCGGATATGgcaagaacacaggcccgctgtatctctctigatgtct 
gtcaggoccatggattgatatggataagaaagaaagagctctccctatcaticaggaaatgtitccagctaccccaaag^cg 
catgaaaaag^aatagccagaggcagecgaccactttcatgacaccaaacacageattgggtaccagagtccatgtcaca 
ccagagggaaaattctgtiacacaatgatigaaaactaataccactaccacttaagctcctacgtgacaactttcccaagaa 
tcagagagatacaagtcaaaactccaagtcaatLgcc^iccaactccrccgategcctttaacctccagagccagaatgttc 
tttgcctttactaggaaagccatctgtcatttacaaaactctgtacattttatcagcagcttatccatccattgcaaatat 
tgtttttgtgccasccacaacatatcgcttctatttcgaccaacatgggggatttgaaggaactctigaagiitctaattat 
atttcaactctactttacaatatictccct^gaaacatacc^ccctgtiaactitetLactaattataagctLacacagagcaaat 
ctaattcctctcccaccgaacaagtccccggacatttaaaaataac&cccacacccccattcaacctgagtattacccag 
ataagatgatacatgagaatacacctcgtaaccticcaaagcactgtiacaaatgtgagcaatgatggtggagatgatgacg 
agatctctgctgtttataccaagccccttagaccgtgtcactcttctgatccggticgtccttgtiatggccatgctgtiata 
ctgcgaatg&cccgtttccaaaagcaaagccaagaartaacctigtgttcaogccgtggectgaatggttatgggtc^ag 
agggagttgatctrtagctcacactrctattactgcagcacaaagattttgcaccttggaaggagcaccgtcttactcgc 
aacctagtggtaaaccaaaacctccacttcacacaaacgattctgaaatticccctctccttcattctatacaaattcatt 
cgattttttcgaaactaaactttatacctatccatactaaattacatggettctatttttgttttatcttgattcagxiaa 
ttactcccttcagtaaacacagactcagtgctscgtsticcgacttatgccacgcataggrcattcagagatgaaaggtca 
agtccccgaacccatctcttgtcttcctgggcattatc^igcccctGcctgctttiagagctcctgaaattcgctagaasca 
tgtctrtcacctaagttgctgataaacacatcaagtagcattggaccgaggcaeagccctgtagtctigaagctgcagttct 
Cccagcggctgacaagccccactatcacttccccgcccgtgctccgctctgccagctgtcaattctcataattgtcctiat 
cgtcaagtccttatttctgcatcttactgcw-catacactgtcaggacagacctttaaaatcatttctcagtgcgatgaaac 
aattct;gacattcacgt:t:atgagca9tcaqct:cataaat.aga-cacatg 
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Oenomic c=ntig ccntaaning e.xena 37 co -IX: 

aaat^accccaactgccaatccaccgctcagtaagcccaccgaccgtgacaccctggcttgactigctggaaacacasaaa 

gggcatgcactttataaaatcagccaaggggaaaatactrgccaaaatg^accgtcgggtaccctgattaatagtctatg 

tggcttcattaattcaeagttactCTiccaatarGCtcacctgccctttcrsgtctgacaaccgtgaaaacttgtgcgatg 

catcgtatatttgatttaggggtgaactggatctciitrgccutcactcttagTGCAATTACGTTGTCCCTGCCXCXCTGQ 

TCATTATCATCYTCATCTGCTTCCAGCACAAGT CCTATCTGTCCTCCACCAATCTSCCn'GTgCTAGCCC'rr CTA C X 1 ' I TG 

CTGTATGGcsaagtcacctctgagtcacggacccgcacagtccaraaggcatctgccccccagtgtcagaaggagggcag 

ggactctcagcagacacttatctttttgtgtctcaacacGTGGTCAATCACACCTCTCATGTACCCyiCCCTCCTTTGTST 

TCAAGATCCCCASCACAGCCTATCTGGTGCTCACCAGCGTGAACCTCrrCATTGGCATTAATGGCAGCGTGOCCACCTTT 

GTGCTGGAGCTGTTCACCGACAATstgagtcazgcagagagaacactcctgctccgatgaccatctctgggagccacagg 

acagtgrtcaattgtcatcttactccacttgtcagtgetattsacactgctgactgccttgccctgtcctcagagtctgt 

cttccctgagaaggcaaagcacctttctttctcgctgtgccttacattttgctggtcaagcctttcagtttcttttGaca 

gCttcccrtacttctiitctttrcccaacgtt^c^icttacsaacagtagcrcctctigccttccactttacacatgagacct 

9ff9cgacgcattcactcctaaggctttcaccazcacct:ctcrzggtgtttrtaccgtcatctct«agat:caatgccttt:a 

gccttgaccataaccttgaactctaatctcaaantctcactcgcctagtccatcgctccatttagatagtatacaaatac, 

cccaacctggatatgccctagctntccttccccrtieeaacttaacgcttttcttgccatccctgticacacccagticgcac' 

taccacccactcggtisgcccaagctggctcttagacrcaccccagatgcttgcdtgctgttgcagatttcccacatirca 

actgcccaticcitgccagttcttccacgcaCGcaeccctasaataaggccticctccccactccggctgtcactgccctirgt 

ccaaacacagcacacaaggccttttacagttgcacaactccccccgtccazacccaccacaccctttcccagctgtaagc 

ttcagatgagttgcctccaaccaccacgccccsg'cagQcctggciingaaacgcccCtcttccgtcacagggtetggtagt 

anatcccccgcccrccaagacttagccaaaacgcgaagcrttscttacctgccgggaggtgccccctcttttctctgcgc 

tcccagagtccttagtccacgcctccagtacaacgtracacccacctacatsgcaacttcctgcctacatacttctccrac 

tcggagcggagtcticcttcttaataattittacctctieccaccGSCtagcacagtgcatccagcgtatagccccttatitca 

gttggtagacattccgccactigttgcctrgtcsgaccacaacz^iccgaccwatttgagaagaatccctaaaattccgaca 

aaaccccgaaactcaaat:attgacccagacatgagcaa::t:t=ct::i:tcaaatgctaagggatt:ct:taat:ggattt9cziit: 

a3CCaaatct:agcccgttcccaagctttattcatCact::ct:c=aiiact.cagagcat:ct:ctccagaci:t:tccaaagaat:ag 

aatt:rt:attgccacatatcaccagccatgcc::gctgctact:zaatcggt;acccgaatt:aaaaggtccggtt'tgt:cc?tag 

agaatcaaatCtccTicttxcactcccat.aciiticagaac^'garacattcrcaggataaaccacgaatgacacccigC'ti'rcct: 

C t C CC t c ac c C t CC c C t C cct: C CC a C 1 1 1 1 1 c r 1 1 1 1 1 r t C = - = t C agAAGCrGAATAATATCAATGATATCCTG AA GTC 

CGTGTTCTTGAT^TTCCCACATTTTTGCCrGGSACGAGQGCTCATCGACATOGTGAAAAACCACGCAAXGGCTGATaCCC 

TGGAAAGGTTTGgtgagt:9aagcagt:ggct:g-aggacgct,i:caatggagacggca<:t.ctgcat:aQgccttggtaccct.ga 

actctgttttggaaagaagcaggtgactaagcacaggatgt-cccccacccccacgcccagtgacagggctcacgccaac 

acagccggttgtggcatgggtcctgcgacacaaccattsguczgtacscs - ^a-.agcattgagaaaagtgaaagggcagt 

tttgaa99t:aaggaaaat:agt:gttatttgci:t:ggacccactggctcatgccactgtctgggttggt:tagaagcact.ssaa 

aagCcaaaccataactetgagaactaggcgacsragggaatcaGaaggaaagatgcaaacuttggctcttttaggcgaacc 

aCgcgcctgcagatgaggtcat.ttattatcc-rtaeacagt^rataaaatitataatgtattacsatccttttctacctLtta 

gaat:ggttaaaaaratttctccggtagcca:ia:igauCawtarrcat:ccat.zagataatatagtcaaatgggccatg=t:ac 

tractgtt:cat:agaagaggcgct:Ctttgcaact:tgggc-acaaaggagacatgt:aaggaatttaaggaat.ggt:tacatgg 

aactagatttaatcgaatctagcggcttaac-gattcactaggatacatgctaccgaaagggcaatctgcttaaagrgct 

tcccgatatttaccactaccaaaacttagaarrcactaaaascactgactgcgaaaattacttgggtcgtttgccztittt 

aaaaggatrt:tcggcatgtct:cat;kaaaaaaagaaa'c:ac-c:agat.atct:tcagtgaagi:t:ac:aaatcgaatacacaii':iggc 

tctgaaattctgattgafcactgagtcaCaaaaagtt^tcccaaatcagacttggaaagtgatcactctettgttaCTictt 

ttttccttgtcatgggtgatagccatttg-gcztaticcgaagatccgrgaattttaagcaacataggcccaaattrtgagg 

aagggccargstrsttgatccct;ccactct:gaccggax:;t;c-gcartgtgcctactagGGGAGAArCGCrTTSTGrCACC 

ATTATCTTGGGACTTGGTGGGACGAAACCTCrTCGCCAl-GGCCGTGeAAOTGGTGGTGTTCTTCCTCATTACTGTT 

TCCAGTACAGATTCTTCATCAGGCCCAGgrgagctictitczragaacccgtggagcaccuggttgagggccacagascag 

gcgcacagggaaacactcaccaatgsgggrtgcartcaac-saactcaaaatatgtgataaaacrgatfCtcctgatgtg 

g9ca:icccgcagcsccc:;c3CCScccaccc-x=aa5accgtg5caagtaccrtti:acaat:ac£acc^:cagagact:gaat:cv 

c ::g t: c c t gaaa a a t a g 1 1 z c a a aggc t c a :i r r t c c r c c c t r c t c c c c c a ag ACCTGT AAAT SCAAAGCTATCTCCTCTG 

AATGATGAAGATGAAGATGTGAGGCGGGAAAaACAGAGAArrCTTGATGGTGGAGGCCAGAATGACATCTTAGAAATCAA 
GGAGTTGACGAAGgtgagagastacaggt'acaatagctcarctccagc-tttcccagccttatgtgccgtaaccragca 
g^:tngccgacttgcttaacaaaagggcat.gtgtccccaaaawgtacatctataccaaggttctgtcaat:t;t;t:atrt:caaa 
aacaccaCggagacttci:t:aaaaaattct;cacTigagaatt=t^.ctgt:gat:at:gaatccccattct:cgaacactr-=gttt: 
tanatgcttacatttatgncttagctat^aaaacacaccaawattgcatacctagtcaaactgagtagagagacaatiggT: 
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Gencmic concig concaining exona 42 thrcuch 43: 

tttt;aa9acacctgcaatacat:atatatgtt9aatagat:g3aaaatt:atst:39at:gai:aacgaacgacacggw::ctaaaa 

agacaggtcaaaaastaagttcactttcattttgacccrcagaatcattcagaagccagtcgccacaaacccagaccaag 

gctcttggcacatcaaatatgcctaragcttacggtsatcgacaagtcttatgccgcagtanatgtggttcatiagccctg 

CCttccacagttgcttgggagagctgtgagtcactcagCjCCtacgaatctttacattttgtctgtcgcagATATATAGAA 

aCAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTCCGCATTCCTCCTGCTGAGcC aaagacact 1 1 gt Ct a t a t Cgcgt t 

tgtccctattagctcagactatctCLacccaatcaagcaacgacgctcgctaagaggcaaaagtggattttaaaggcztc 

tgtatjttatgccagcatggagcaactagtcaccgacaagagacggaccc-gtatgtcaacagaatgatttcagagaatcc 

aatacaatttaagaaaaagcatggcgctgggcgcagtigatitcactcctcwaatcccagcactttgggaggccgaggtggg 

cggactcacgaggtcaggagattgagaccaccccggccaacacggcgaaaccccacctctactacaaatacaaaaartag 

ctgggcatagtagtgcattcctgtagtcccagctactcgggacectgacccagcaaaattgcttgaacctaggaggcgga 

ggttgcccagattgcgctgctgcactccagcccggcgacagacrgagacicatgtcaacaacaaaaacagaaaaagcacg 

cacatctaaaacatacttttrgtgatccatctgcgacggcgatigacatccaaatagttttrtaaaaatagattttctcctt 

tctggttrccgtttgtgttcttttatgccctrttgccagactaggtggtgcaatctggctagctggctttcatcactigtt 

t:ttcac3cactaac=^t:ggcct:caacctgacaact:caaacaatat:ttacaaat;acagccacacttaaaatggtccca::t:a 

tgaaatacatattt:aaatat:ct:acacgat:gt^ctaaaaccaacaaaat:aCwt:gatccct:ctctgacactt:aagaattcaa 

ggttteacgcagctacgtgttaggcccattnatiattcacgt^r'tagac^ctgcctatacaacttaaticrttcctctcca 

gTGCTlTGSGCTCCTGGGAGTTAAXGGGCCTGGAAAATCATCAACTTTCAAGATaTTAACAGGAGATACCACTGT 

GAGGAGATGCTTTCCTTAACAAAAA'rAGgtcacaaaagsaGCsccttgtatCtcgctgcaaagactttgtttttaatsta 

t:ctaaagaaataggStgttatttt:tGattacact:g5tartt:t:^agagtccataaaaatgttgaaatatagcaaagggcaa 

agaagcacataaaatcacccacgacttcaata-ctagagataaccacaarttacactccccctcagtcccattctctcct 

tttaacagctct:aLCcaggt;at:aat:ttacatacaaE.ataatwt:gctt:gtwttttaagagtat:aaCt;tagtgat:t:tttsgt. 

aaatcgagagttttgcaaccateaccacaatscagrtttagaacttttccatcaccccacatctgtcttacatacacata 

taaatgtgccatacaattgagatcacactgca^giiagaatc-aaaatcacttcttattgttaatgagtigtattatcaata 

cttcccagcgggttacatttcctaagatgtccaac-trtacartgctacaraaaatccccccatgtaeatgcacciiataat 

ttatrtaataaattccttataaatgtcggacacacracTiCtccatttttcactatgtaaatatigtccctgnatacatctt 

ttattattccctcaggaacaattcccacaaaccaaatcgccct.ct.ctaaaGagcatacaaatitgactgagccaccg'tr^iac 

gccattCttctgagactgcacaggccacaaaccaacctgatctircgggaazacagccacattttataggctccttagaTiaa 

t:gctaccctaagtac'Ctraaatatgtggggcct:ct:;tggcczct:ctt:tc::tt;i;gagacggagt;ttcacticct:actgccca 

ggctggagagcaat.ggcgcgaccttggci;cact:gcaacctic=5cctcccaggttcaagcgact:ctcctgcctcagccticc 

tgagcagctgagactacaggtgcccgccacaacgcctgcctaattttttrgtatttncagtagaaatggggtttcaccat 

gttggccagactggtaccgagctcctgacc^^aggcgacccaectgcctcagcctcccaaagttctgggattacaggcat 

gagccactgcgcccggC'Ctictctggacttattratgtgaagacatagtiacaaggcagtggctittcagagtttct.tgaccat 

gaccgttgtgggaaatacattttatacctcaacczagtatgcacacacagacatgcagacacatgcataacctaaagtitt 

cataaagcagtacctactgctactaactgtagcgcaccccgccatttczcattctaccttatactgcgtcattaaaaaag 

i^gccggtcacgacccaccaaarctatietcesaaaccactaargaacaatgacecacaattcgaacacactggacaggggc 

atagccaacaaaaccgaaaaaagcaaggaaar~aacgt:atccatgatctcctcccccgt:ctcttacatttt:t:gcagzagc 

aat:gtaaaggaat:cctaagagaacagacazr=cgggaatagcaggcctagcgctgcacaaetcct:ttcct.aggctt:gci::c 

ctagtaccaagctcctgacgcaeacagcac^iggcagcaataaceagcccatagtaaggtttgtcacagggaccggczgca 

agaaccgat:t:t:grttggtatagctgcgaggccet:ggcaccc::gt;ccac5tgt:gccticaatcctaatt:ct:gaaaaacgct.g 

accct3ggggtgct:aatt:agatacaca9agaggaargaat9c::gcca9aaggccaagt:tcat:ggcaatgccgctncggct 

Saggt5cagtcat2agtct;ggaacgt:gaaca=tsaac-zctcccacaC=tgatccctcactt:gact:5gct:ccat.agaacc 

ceaaagccaccccaccaccacat,aaattgtrzctctacgrtct:gt.gttgctcacactcaaaacttct:gggcct:t create 

tggtgcacgtgaatggrgcatratgagtgaeg-c-caggaCccggccTitaccgtcaaagccctggggcagtgtgactgagat 

tgtcggtaaagaatgtgcactggttggcazgacc-cagaaa-tctgaaatgggactgcaccTigcagactgaagtgrtcag 

agagccagggaggtgcaacgactiggcgagggraGaggcacgaaccctccccgccaggaagagccagcatcccggcggcac 

aaagccT;gtgcttccaactaacagcagat9"at" ggt:at:c:;-tgt;aawgsagaaccatactt:tacaggaacatraggcca 

gactgtctaaccagagtatcrctacctgctzaaaatc^aacTiagtcnccrtgncGtctgcagTATCTTATCAAACATCCA 

TGAAGTACATCACAACATGGGCTACTGCCCCCAGTT'TGArGCCATCACAGAGCTGTTGACTSGGAGAGAACACGTGGAGT 
TCTTTGCCCTTTTG AG AGG AGTCCCAGAG AAAG AAGTTGGC AAGgU a c t g t ggg cacctgaaagccagcctgtcticctcr 
cgcawCCtgacaatatataccttatggccc-CccacacgcarcgaccccaggccgtttcrccticatgaatigcagcagGac 
aaaatgctggttctttatatetgccttcaggstggaaacczgcaacggcggtggggcagggctgggtgggcagacaggga 
Gi:gctgct:cecaccacacgaci:cccttc'c::=ctgcrt:::ggc::cctcaccagctgtcaggtt:atgattatagaat:c:iagtc 
c-actcacrtaaaagaac£Ct.catacaT:gta*3iig-aggacagcatgacaaaai:tcccaagccagaccaaagtcaaggtgc 
^ cc:: :;a = caCCg^agGTTGGTG AGCGGGCaArT^GGAAACTGGGCCTCGTGAAGTAXGGAGAAAAAT^ 
TAGTGGAGGCAACAL^CGCAAGCTCTC-ACAGCCATGGCTTTGArCGGCSGGCCTCCTaTCSGrGTrTCTGc 
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ctgt99atggaaaact9Ctgt:tcts9CCtga9tggaaaacatoactgttcaaaa9tcctatat9t:cc»9ggct9CC9CMi. 
gattggcttgtctccccccagggacagcagagcaaccttggaaaagcagagcgaagctcctcccttggcacacactgcgg 
tggctgcaccatgcctgcagat:gcCcccaaatagaggcac6ccaAgcactt:tgtctcttagcgtgat:tgaggct:ggatat 
gtgatttgatctttctctggaacatcctttctaatcatctctgtgttcattccccgaaaatgaagagtgtggacacagct 
ttaaaatccccaaggCagcaactaggtcacagttcctcacacacggatagargaaaaacagaCcagactgggaAgtgccc 
cccgaccettcttcttctgtagatiaagagcaciigatgttattacgggaasaagcctttgaggcttttatgtattccacct 
cggtct;ggaatctgt:ttctgtaaggctaacagccgcaacatactagggtaai:ctgagt:gagctggaattaaaaaaaaaaa 
ggaatttcaccccaatcccanactgacttcaatagaggccccagacaaaaagttgttttgtat 
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Genomic ccntig containing ASCI e^Ncns 46 to i5: 

n9ccnn9ttnaaaancaaaactt:nnnnna4ac^r*aannt:tar4nccn5nnnt:ccccccagaaaaaacnaaaan9attt:ccn 

cccng95gsgncccccnantcnaaaa«f5ccccr:cctnzcCgj:c5ngao3caaagntttctttggaactttcaaT:tcttsg 

tcccccaaaacctaccactgagaatctaatcacacaaaaaagtacceagaatacctgagtttcctigcatcaataagacat 

Ctat.aacaatgaccctgCttacaaaccaatttsaaactitac'CctaattctittgactcaticaagaaacaactagaaCg^ca 

agttaaaaccraagctgctticaaagacgctcctgcatctiaaaaacaaatccaccciitgatti^tttttccccccagcaaat 

aagacc t a c 1 1 ta 1 1 craattacagGArGAACCCACCACAGGCATCWATCCCAAAGCCCCSCGGTrcrTGTGGAATTGTG 

CCCTAAGTGTTGTCAAGGAGGQGAGATCAGTAGTGCTTACATCTCATAGgtCcgtagtaaagtcttgggttCCtcactigt 

oggatgttctaactttccaagcagaatatgccarcattittgca&aaac&agaaaatiacacaaaagcaaagagtaaaacaa 

cCattacctoaaatt:acatatgcaratwct:t:acaaaaatgcaaecccagt:ataaataCtcct:ctt:tttcactt:aatat:at 

tgtaaacattatt:ccaagtcagtgcatttagetst:Gatttct:tatagctcgatagtattccattaggatatactcrt.att 

taactattcecccttttgtagacatt^ggattattcccaacrtccccacaatcgtaaacaccactacactgaacagcatc 

atccccatacccacatgtacttgtaacagaatacaartccctacgaagctggaatgctggaagtcatggcgatgttc^ica 

t gg t: t a c agaga a t c t c t c t: a a aa c c aa aac s ^ c 1 1 1: c t g 1 1 1 a c eg c acT ATGG AAG AATGTGAAGCTCTTTGCA CTA 

GGATGGCAATCATGGTCAATGGAAGGTTCAGCrGCCTTGGCAGTGTCCAGCATCTAAAAAATAGg t aa t aaaga t aa ^ 1 1 

ctttgggatagcgcccagtgagaagccttgacatictattcttt^gtgagtatacaaatggtgcctccaaaacaaagsgaa 

ataaaact:gagcaaaacagt:atiagtgsaaagaat:gacggcttcgaagtcc9aactgcat:tcaaattctgt:ct:tcaccai:t 

tactggtccratgacccttgggcaacctacttaactaccgcaacagtcagttcccctgaaagatctacctcctagcc-zrg 

tgccacagacgaaatgaaaaaaattcacatgtcccagtactgctgagagegcaaactttscagtcaaacacaaatgsctt 

tgcarcctggccctaccaatcatgagctctgacccacgggcaagtgactaactccctgggcctcagtttctctgtaacat 

ccgtcagacttcatgggtccaggtgaggattiSAaggagaccacgtattCacagcacatgccatggtgcttcacatiaaaat 

aagtactcagtaaatgatiaactggttccttCTicccagaaactzatttctgggcccgccaecggccgccettitttcatggc 

acaagttggstccccagggctcagtatccttttaaacagttrtctggagaccccccatttigggtactttttcctgccttc 

agGTTTGGAGATGGTTATACAATAGrTOTACGAATACCAOGGTCCAACCCCCACCTGAAGCCrCTCCAGGATTTCTTTOG 

ACTTGCATrrTCCyGGAAGYGTTCyAAAAGAGAAACACCgGAACATGCTACAATACCAGcrrrcCATCTTCArTAT 

TGGCCAGGATATTCAGCATCCTCTCCCAGAGCAAAAAGCGACTCCACATAGAAGACTACTCTOTTTCTC^CACAACACTr 

GACCAAgtaagcttccgagtigCcaaaacagatticacttcccaeg^cgtggattcccgccccgaczacccccgcccatacgtic 

caagagcagtctgtatcttgaactggtgcttgaatccccgatcrtactattcccagctatgccttttact.aaacctctictg 

aacctgaaaagggagatgaLgcccatg^actccatagcatitia^tgtgagaatttacligcaaCaatiaaccacaaaaaccac 

catctagtgagcacctaccacgggccaggcatrtcacttggtgcctaatcctatttaaaccagataaaaaagtaccaaat 

aggccctgacacttaagaagtactcagtaaatatttccttcectcttccctttcaatcaagaccgtatgtgccaaactaaa 

tggacgactgagcagttggtgatgtiagggstgggeggcgatacagaaagtcagtttccsgccgggcgtggcggetcacgc 

ctgtaatcccagcaccctgggaggctgaggsgcacgcagaccatgaggtcaggagatccagataattcctggccaacaggg 

tgaaaccccgtctctactaaaaatBcaaaaa::tagct:gggc:at:ggtggi:gcgcacttgi:agtcccagctacti:gcgaggc 

tgaggcaggagaactgctcgaacccaggacczggaggttacaccgagccaaggtctcgccaccgcactccagcccgggga 

cagagc:aagaccccat:ttcaaggggggaaaaa.aaetctattC"t;aagtt:gtt:att:gct:tcttt:caagtatt:ct;:t:ccct.cc 

tccacacacagttttctagttaatccatttacgtaattccgtatgctcctactrgacctaatttcaacatctggaaaaat: 

agaactagaataaagaatgagcaagtcgagrggcactcattaaaggcccatcccaatcrrttaacasGTATTTGTGAACTT 

TGCCAAGGACCAAAGTGATGATGACCACTrAAAAGACCTCrCATTACACAAAAACCASACAGTAGTGGACGTTGCAGrrrC 

TCACATCTTTTCTACAGGATGAGAAAGTGAAACSAAAGCTATGTATGAAGAAtCCTGTTCA^rACGaOGTGGCrGAAAGTAA 

AGAGGAACTAGACTTTCCTTTGCACCATGTGAASTGTTGTGGAGAAAAGAGCCAOAAGTTGATGTGGGAXGAAGTAAACT 

GGATACTGTACTGATACTATTCAATGCXA^GGAATTCAArGcaatgaaaacaaaaCtCcactacaggggcagtgcctCCg 

tagcctatgtcntgtatggccctcaagtieaaagactrgaa^TitagttctGtacctatiaccta^tgtgaaaccrcasriacgg 

aacccaatggacatatgggtccgaactcacacc-titttrtttirtctctgctcctgTigcaccctcattQgggctgcaacaa 

taactcatcaagcaatcatggecagcgat:;:5;:tcatcaaa£zcaaaaGStaatgcacai:cctc:attcactaagccatgcc 

atgcccaggagaccgguttcccggcgacacs-ccat-gcccccaatgactigtgccagagccattagcgccaagtcrccca 

gaaagtttgaagcaccatgct.gt:gtcaccczcact-czg"=aaacct:gc5ctgctcacagtct;accaacat:t9aa::atca 

gccgacagaacsgtgccatgcgcggctaacawcctccv'rsat^iccctctgataagctgttictiggtsscagtaacatgca 

acaaaaatgtgcgcgtictccacgcacggcasactttggctccactgrtatatcgtcetatgcctcgagceatggccstaca 

gggtcatccttatgagactctiraaatatac-sacatcccgrcaagacgcaaagaatcaacagccaaaccgctgcggctgc 

aactsctgaagccagggcatccgatt:aaa9agaci:aT;gcgt;tcaaacctagggaaccctgtgcccattttgtcc^eactgt: 

ctgctaacatggt:acaccgcat:ctcaaga::5t.ttauctcacacaagcgtattattt:ctggct:t:t:t:t:gaattaat:csagaa 

aacsaaaagatggacttgtattttgacaaaaatctcigtacctctcaatgttatctcgaautttaagrtctatcagtigac 

CcctgaatcctcagaaCggcctct'tgtagaaccctgtccTiatagaggagtatggccactgcccactatttttaz^ttct 

tatataagtttigcataticacrcacgaccag-gcctagaaagcaatgtigaGgctcaggatctcacgacattatatrrgagt 

CCctttcagaCcacccagga'EaccctcaatiCcactrcaccaatcaaatiaCtticttsaGtictaccctgcagctgaaagag 
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cacgtacgtacgtacaagactagagagacactaactctcagtacacttcctccgccatigtcattcagctcactss^ctiac 
aaatataggttgtcttgtggttgtaggagcccactgtaacaatactgggcaccctttttttttctttttttaattigcaac 
aatgcaaaagccaagaaagttt&agggtcacaagtccaaacaacgaactctccaacagggaaaacagctagcctgaaaac 
ttgctgaaaaacacaacttgtgtttatggcatttagtaccttcaaataattggetttgcagatactggataccccatcaa 
atctgacagtctcaaacttttcatctcttcaaccactagtcaagaaaaaatataaaaacaacaaatacttccatatggag 
cattittccagagttttctiaacceagtcttattttttctagticagtaaacacttgtaaaaacactgtttcactaatacttac 
cgtcaactatcttgagagaaaagaaaaatatga^agaactaccgtctggggaagctcaagtgatctttcaatatcattac 
taacttcttccacttttcccagaattcgaatattaacgctaaagstgcaasactccagatttcaaattaacctstctata 
ccrtttaaatrtaeagaaCattatataacccactgctaaaaaagaaacaaatigattgttttagBagtcaaagetcaaCat 
cgattttaaaauattaag 
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(.OCShmMSEQIOMo. U 



S PPRC 
BPPRE 
7 9^R£ 
a PPR6 

9 PPRS 

10 PPftE 



ISRE 

4 s«e 

iSRE 

9 5AE 
to SRE 
It SRE 

n SRS ■ 

14 sne 

15 SRE 

ta aRE 

1 ROR(f«n»0tc*Pc»fW5B0i«fatti»d) 

2 ROft(f«tinote »t!«l Mrtepwt rebriMl 

3 ROR(r«un«e led rw:«olar riMsO) 

4 ROR(fttinoiCje<lf«c«Ptarr«<»l*d| 

5 RORtrWntsie tea recepiof raiaiect) 
5 ROR4r«tine>e*oili«e«cior rmmt 
7 ROR<ftltnoic»c<»«cepi« retolwt) 

1 sneBP-tofSttOK" 

2 SREBP-lor-«oo»* 

3 SREep-f or 'S bcut' 

4 SRGBP-1 or-^btw* 

5 SBB0P-1or*Bbo«* 
e SRESP-I Of •Ebon' 
7 SP6BP-lor*SbPX- 

SSREBP-IOf'Stwx* 
to aREBP*1«*Eboit' 

11 SREBP'1 <K '5 Oox* 

12 JSR6BP*10f ■E fiw* 

13 SftSSP-lofEOOx' 

14 Stteap-lor'EMx'' 

15 SREBPO WEttOn* 

16 SftEBP-t 

tr 3R6BP*! or '6 Ben* 
IB SREBP-lOf'eaox- 
1» SRSeP*1 c3r*EbBX* 
JOSRBBP-lor-eiwx- 

2t sneap-ior-Hbo*- 

22SREeP-H»-Hi»»- 

23 SREBP't cr 'E Dm* 

24 SReaP'tot 'gbok* 
as SREBP-IOf'Ebo** 

28 SReaP*1<y"Ebc«- 
27 SBGBP.I ot'EbOK* 
23 sn6BP.1 or *e boat* 

29 SReBI*-1 WEftOx* 

30 SREBP'1 <r*6B0«* 

ai s;^BP-i *E ec^* 

32 SREBP-I w'EbCit' 

33 SREBP>1 Of "E box-* 

34 SREBP't or -Ebon' 
SSSREBP-lofEfto** 

36 SREBP-I «f "E DOm* 

37 SREBP-1 C 'E bO»* 

3B sreBP-t«r*esox* 

39 SRESPM w*eb0V 

40 SREBP-I or*EtKJ«* 

41 SREBP-i or *€ box- 
12 SREBP-1 Of •& bax* 

43 SREBP-l Of -S bOK' 

44 SREBP-I or -E bOx* 

45 SRKBP.Ior-^Mx- 



21S0-3161 
215ft>216a 
41IC-4I39 
507S-S087 

S731-d743 
7230.7233 
7S£4.7Sfia 



AGGTAAAAarCA 
AGAaTAGAGCOCA 
ATGTCAACTTCA 
AGTTCAAAAGOOCA 
AGGCCAGCACSOCC 
AGGGCAGAACTOA 
ATGCCMOOtCA 
GGG6CAAG5GTA 
ACCTAATQAfiGACA 
GQATCACOAOGTCA 



14|.«»lt 

14 CWTOwwrnt 
13 L«M 



U Co*noi«m«f« 
15 ComowTrtfti 



159.166 




8 Ltad 


11».1140 


CAGCTCAC 


8 Contpteffitm 




CACACCAC 


a Lad 


tS0»-18lS 


CAGCCCTC 


8 Cotn(»»«n«nt 


tao«.i90i 


CAGCCCAT 


0 tcad 


2sa>257a 


CAACSCAC 




U»-3310 


CAGCTCAC 


a L4W1 


347SA47T 


CCGCCCAC 


a L«iMf 


47»4-)791 


CTCCCCAC 


9 ComDtcTisnt 


4K:ae09 


CAOCCTAC 


8 Cefoptwtv'i' 


4975-1977 


CACCTCAC 


8 Contpletnem 


e4a?-54$4 


CAGCCTAC 


8 Convl«'*wtt 


6S».€573 


CACCCAAC 


8 ComolMrieftt 


6rir4734 


CACCCTAC 


8 ua4 


70JT-70i« 


CACCCAAC 


8 L«ftd 




CAOCCCTC 


8 Co<rpt««««rt 


*.«.l7t 


^OGGTCA 


7 COTipiWWH 


■6M73 


AAGGQTCA 


S Complement 


2S3.370 


ATGCGTCA 


ft tuD 


3«1.370 


TGGGTCA 


r u»d 


2£'9<222! 


TAGQGTCA 


SUM 


a:- 9-2225 


AGGQTCA 


r L«4d 


3&A3.3e49 


TGGGTCA 


7 L«M 


ceci^io 


AAGGTCA 


7 CompdfMnt 


a73U7S 


ACACCTG 


7 Cornplament 


i38-S4l 


ACACATQ 




M7.541 


TCATQTG 






TCATGTG 


7 Conv>^Bni 


925^1 


ACAcrrc 


7 Lod 


947.973 


TCACTTG 




» 9.974 


TCAAOTG 


7 ComptarrMm 


1C«3*lDfi9 


ACAGGTG 


7C(»mplem«oi 


1 -34.1110 


TCACTTG 


7 Ltad 




TCAAGTG 


7 Compltmwit 


!5S»-lSa7 


TCACTTG 


1 L«44 


I3;t|.l67* 


TCAAATG 


7 L*sd 


1T4tt.l7S* 


ACACTTG 


7 LMd 


1-49.1TSS 


ACAAGTG 




*353.I8S« 


TCATOTO 


7 Lead 


••53.1659 


ACACATG 


7 Complement 


*«99.190S 


ACAAATQ 


7 Comptemenl 


2*99.2205 


ACACGTQ 


7 U»d 


£393.3399 


ACAGCTG 


7 Comptemeni 


2««9'37005 


ACACCTG 


1 L«i<l 


ZS77.29U 


TCACATG 


7 CamplMntnt 


2?40.27-i6 


acaactg 


7 Co«TiptMin«nt 


£96 9.2979 


ACAAATG 


7 Ijtad 


I»7».29e$ 


ACACATC 


7 UM 


;»1.29«7 


ACATGTG 


7 L*»<t 


2980-2986 


acatgtg 


7 Co<"P»«nwnii 


2iaz.29Be 


ACACATCS 


7 CowBtenwnt 


24fi1-34S7 


TCAGGTCJ 


1 LMd 


sAS2-u«a 


TCACCTG 


7 COflW'w^crH 


2S47.3SS3 


tCAACTG 


7 Compicmeni 


2:s;.37£a 


ACACATG 




£228-1232 


TCACCTG 


7 Lawt 


^532.4598 


ACACGTG 


7 Comptemeni 


45B8-1594 


TCAOTTG 


7 Ltad 


*89).*«8r 


TCAGGTG 


7 Lead 


49SM9S7 


ACAAATG 


7 Uad 


S0S5.9102 


TCAAATG 


7 Ce»mol«««<M 


3912.59IB 


ACAOTTG 


r Lsad 


3913'$9I9 


TCAACTG 


r Corrtpiemttm 


5245.€2St 


ACACATG 


T CoinptenMnt 


'3288.6294 


ACAACTG 




»a3.o629 


TCATTTO 


7 l«3d 


S83S.6S42 


TCACCTG 


7 t^ad 


?837.8»3 


ACAGGTG 


r Comiitvinwit 


7ft32.703a 


ACAGGTC 


7 CompHwnani 
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40 sniBP- 

47 SRESP' 



49 jREQP* 
Sa^REBP- 

51 5R69P- 

52 SRMP' 



54 SRSBPM 



1 w-Ew' 

■1 or-esoi- 
■tOT'ESOl* 

tcr'EbaO 
■tor-ESei- 

ior*Eto«* 
(Of "Sso*' 
tor*Sbo«* 
t WE sot* 
>i Of '6 em* 
WE km* 
Of '6 ££>»* 



MC1.T107 
7139.7144 
713».7t45 

7497.7473 
7640*7«4a 
764r*7M7 

76S4.7««e 
7735.7741 
7B3«.7e44 



eost.do57 





.Md 






ACAono 


? -.CM 


TCAACTO 




ACACCTC 


7 ComotwiMni 


ACAGora 




TCATTTG 


7UnH) 


TCAAATO 




TCACTTG 




ACAACTG 




ACAAATG 




TCAGGTG 


7 C3fWlem«nl 


TCATCTG 




TCAGCTC 




TCAGCTO 


7 CO'xp'V'VV' 
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SEQUENCE LISTING 

<110> University of British Columbia 
Xenon Bioresearch, inc. 

<120> METHODS AND REAGENTS FOR MODUIiATING 
CHOIiESTEROIi LEVELS 

<130> 50110/002WO5 

cl50> 60/124,702 
<151> 1999-03-15 

<150> 60/138,048 
«;151> 1999-06-08 

<150> 60/139,600 
<151> 1999-06-17 

<150> 60/151,977 
<151> 1999-09-01 

<160> 287 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2261 
<212> PRT 

<213> Homo sapiens 



<400> 1 



Met 


Ala 


Cys 


Trp 


Pro 


Gin 


Leu 


Arg 


Leu 


Leu 


Leu 


Trp 


Lys 


Asn 


Leu 


Thr 


1 








5 










10 










15 




Phe 


Arg 


Arg 


Arg 
20 


Gin 


Thr 


Cys 


Gin 


Leu 
25 


Leu 


Leu 


Glu 


Val 


Ala 
30 


Trp 


Pro 


Leu 


Phe 


lie 
35 


Phe 


Leu 


lie 


Leu 


lie 
40 


Ser 


Val 


Arg 


Leu 


Ser 
45 


Tyr 


Pro 


Pro 


Tyr 


Glu 
50 


Gin 


Hi a 


Glu 


Cys 


His 
5S 


Phe 


Pro 


Asn 


Lys 


Ala 
60 


Met 


Pro 


Ser 


Ala 


Gly 


Thr 


Leu 


Pro 


Trp 


Val 


Gin 


Gly 


He 


He 


Cys 


ABn 


Ala 


Asn 


Asn 


Pro 


65 










70 










75 










80 


Cya 


Phe 


Arg 


Tyr 


pro 
85 


Thr 


Pro 


Gly 


Glu 


Ala 
90 


Pro 


Gly 


Val 


Val 


Gly 
95 


Asn 


Phe 


Asn 


Lys 


Ser 


He 


Val 


Ala 


Arg 


Leu 


Phe 


Ser 


Asp 


Ala 


Arg Arg 


Leu 








100 










105 










110 






Leu 


Leu 


Tyr 


Ser 


Gin 


Lys 


Asp 


Thr 


Ser 


Met 


Lys 


Asp 


Met 


Arg Lys 


Val 






115 










12 0 










125 








Leu 


Arg 

130 


Thr 


Leu 


Gin 


Gin 


lie 
135 


Lys 


Lys 


Ser 


Ser 


Ser 
140 


Asn 


Leu 


Lys 


Leu 


Gin 


Asp 


Phe 


Leu 


Val 


Asp 


Asn 


Glu 


Thr 


Phe 


Ser Gly 


Phe 


Leu 


Tyr 


His 


145 










150 










155 










160 


Asn 


Leu 


Ser 


Leu 


pro 
165 


Lys 


Ser 


Thr 


Val 


Asp 
170 


Lys 


Met 


Leu 


Arg 


Ala 
175 


Asp 
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Val He Leu His isys Val Phe Leu Gin Gly Tyr Gin Leu His Leu Thr 

180 185 " 190 

Ser Leu Cys Asn Gly Ser Lys Ser Glu Glu Met He Gin Leu Gly Asp 

195 200 205 

Gin Glu Val Ser Glu Leu Cys Gly Leu Pro Arg Glu Lys Leu Ala Ala 

210 215 220 

Ala Glu Arg Val Leu Arg Ser Asn Met Asp He Leu Lys Pro He Leu 
225 230 235 240 

Arg Thr Leu Asn Ser Thr Ser Pro Phe Pro Ser Lys Glu Leu Ala Glu 

245 25 0 255 

Ala Thr Lys Thr Leu Leu His Ser Leu Gly Thr Leu Ala Gin Glu Leu 

260 265 270 

Phe Ser Met Arg Ser Trp Ser Asp Met Arg Gin Glu Val Met Phe Leu 

275 280 285 

Thr Asn Val Asn Ser Ser Ser Ser Ser Thr Gin He Tyr Gin Ala Val 

290 295 300 

Ser Arg He Val Cys Gly His Pro Glu Gly Gly Gly Leu Lys He Lys 
305 310 315 320 

Ser Leu Asn Trp Tyr Glu Asp Asn Asn Tyr Lys Ala Leu Phe Gly Gly 

325 330 335 

Asn Gly Thr Glu Glu Asp Ala Glu Thr Phe Tyr Asp Asn Ser Thr Thr 

340 345 350 

Pro Tyr Cfys Asn Asp Leu Met Lys Asn Leu Glu Ser Ser Pro Leu Ser 

355 360 365 

Arg He He Trp Lys Ala Leu Lys Pro Leu Leu Val Gly Lys He Leu 

370 375 380' 

Tyr Thr Pro Asp Thr Pro Ala Thr Arg Gin Val Met Ala Glu Val Asn 
385 390 39S 400 

Lys Thr Phe Gin- Glu Leu Ala Val Phe His Asp Leu Glu Gly Met Trp 

405 410 415 

Glu Glu Leu Ser Pro Lys He Trp Thr Phe Met Glu Asn Ser Gin Glu 

420 425 430 

Met Asp Leu Val Arg Met Leu Leu Asp Ser Arg Asp Asn Asp His Phe 

43 5 4 40 44 5 

Trp Glu Gin Gin Leu Asp Gly Leu Asp Trp Thr Ala Gin Asp He Val 

450 455 460 

Ala Phe Leu Ala Lys His Pro Glu Asp Val Gin Ser Ser Asn Gly Ser 
465 470 475 4B0 

Val Tyr Thr Trp Arg Glu Ala Phe Asn Glu Thr Asn Gin Ala He Arg 

485 490 495 

Thr He Ser Arg Phe Met Glu Cys Val Asn Leu Asn Lys Leu Glu Pro 

500 505 510 

He Ala Thr Glu Val Trp Leu He Asn Lys Ser Met Glu Leu Leu Asp 

515 520 525 

Glu Arg Lys Phe Trp Ala Gly He Val Phe Thr Gly He Thr Pro Gly 

530 535 540 

Ser He Glu Leu Pro His His Val Lys Tyr Lya He Arg Met Asp He 
545 550 555 560 

Asp Asn Val Glu Arg Thr Asn Lys He Lya Asp Gly Tyr Trp Asp Pro 

565 570 57S 

Gly Pro Arg Ala Asp Pro Phe Glu Asp Met Arg Tyr Val Trp Gly Gly 

580 585 590 

Phe Ala Tyr Leu Gin Asp Val Val Glu Gin Ala He He Arg Val Leu 

595 600 605 

Thr Gly Thr Glu Lys Lys Thr Gly Val Tyr Met Gin Gin Met Pro Tyr 
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610 










615 










620 










Pro 


Cys 


Tyr 


Val 


Asp 


Asp 


He 


Phe 


Leu 


Arg 


Val 


Met 


Ser 


Arg 


Ser 


Met 


625 










630 










635 










640 


Pro 


Leu 


Phe 


Met 


Thr 


Leu 


Ala 


Trp 


He 


Tyr 


Ser 


Val 


Ala 


Val 


He 


He 










645 










650 










655 




Lye 


Gly 


He 


Val 


Tyr 


Glu 


Lys 


Glu 


Ala 


Arg 


Leu 


Lys 


Glu 


Thr 


Met 


Arg 








660 










665 










670 






lie 


Met 


Gly 


Leu 


Asp 


Asn 


Ser 


He 


Leu 


Trp 


Phe 


Ser 


Trp 


Phe 


He 


Ser 






675 










680 










685 








Ser 


Leu 


He 


Pro 


Leu 


Leu 


Val 


Ser 


Ala 


Gly Leu 


Leu 


Val 


Val 


He 


Leu 




690 










695 










700 










Lys 


Leu 


Gly 


Asn 


Leu 


Leu 


Pro 


Tyr 


Ser 


Asp 


Pro 


Ser 


Val 


Val 


Phe 


Val 


705 










710 










715 










72 0- 


Phe 


Leu 


Ser 


Val 


Phe 


Ala 


Val 


Val 


Thr 


He 


Leu 


Gin 


Cys 


Phe 


Leu 


He 










725 










730 










735 




Ser 


Thr 


Leu 


Phe 


Ser 


Arg 


Ala 


Asn 


Leu 


Ala 


Ala 


Ala 


Cys 


Gly 


Gly 


life 








74 0 










745 










750 






lie 


Tyr 


Phe 


Thr 


Leu 


Tyr 


Leu 


Pro 


Tyr 


Val 


Leu 


Cys 


Val 


Ala 


■ P 


Gin 






755 










760 










765 








Asp 


Tyr 


Val 


Gly 


Phe 


Thr 


Leu 


Lys 


He 


Phe 


Ala 


Ser 


Leu 


Leisi. 


Sept.Pro- 




770 










775 










780. 










val 


Ala 


Phe 


Gly 


Phe 


Gly 


Cys 


Glu 


Tyr 


Phe 


Ala 


LeuTphe *Giu-' 6!Ri "^3jja' 


785 










790 










795 












Gly 


He 


Gly 


Val 


Gin 


Trp 


Asp 


Asn 


Leu 


Phe 


Glu 


Ser 


pro »5ral 


Glu 


• Glu 










805 










810 










8i5 




Asp 


Gly 


Phe 


Ash 


Leu 


Thr 


Thr 


Ser 


Val 


Ser 


Met 


Met 


Leu 


Phe 


Asp 


Thr 








820 










825 










830 






Phe 


Leu 


Tyr 


Gly 


Val 


Met 


Thr 


Trp 


Tyr 


He 


Glu 


Ala 


Val 


Phe 


Pro 


Gly 






835 










840 










845 








Gin 


Tyr 


Gly 


He 


Pro 


Arg 


Pro 


Trp 


Tyr 


Phe 


Pro 


Cys 


Thr 


Lys 


Ser 


Tyr 




850 










855 










860 










Trp 


Phe 


Gly 


Glu 


Glu 


Ser 


Asp 


Glu 


Lys 


Ser 


His 


Pro 


Gly 


Ser 


Asn 


Gin 


865 










870 










875 










880 


Lys 


Arg 


He 


Ser 


Glu 


He 


Cys 


Met 


Glu 


Glu 


Glu 


Pro 


Thr 


His 


Leu 


Lys 










885 










890 










895 




Leu 


Gly 


Val 


Ser 


He 


Gin 


Asn 


Leu 


Val 


Lys 


Val 


Tyr 


Arg 


Asp 


Gly 


Met 








900 










90S 










910 






Ly3 


Val 


Ala 


Val 


Asp 


Gly 


Leu 


Ala 


Leu 


Asn 


Phe 


Tyr 


Glu 


Gly 


Gin 


He 






915 










920 










925 








Thr 


Ser 


Phe 


Leu 


Gly 


His 


Asn 


Gly 


Ala 


Gly Lys 


Thr 


Thr 


Thr 


Met 


Ser 




930 










935 










940 










He 


Leu 


Thr 


Gly 


Leu 


Phe 


Pro 


Pro 


Thr 


Ser Gly 


Thr 


Ala 




Leu 


945 










950 










955 










960 


Gly 


Lys 


Asp 


He 


Arg 


Ser 


Glu 


Met 


Ser 


Thr 


He 


Arg 


Gin 




Gly 










965 










970 








'*!?75 




Val 


Cys 


Pro 


Gin 


His 


Asn 


Val 


Leu 


Phe 


Asp 


Met 


Leu 


Thr 


Val 


Glu 


Glu 








980 










985 










990 






His 


He 


Trp 


Phe 


Tyr 


Ala 


Arg 


Leu 


Lys 


Gly Leu 


Ser 


Glu 


Lys 


His 


Val 






995 










1000 








1005 






Lys 


Ala 


Glu 


Met 


Glu 


Gin 


Met 


Ala 


Leu 


Asp Val 


Gly 


Leu 


Pro 


Ser 


Ser 




1010 








1015 








1020 








Lys 


Leu 


Lys 


ser 


Lys 


Thr 


Ser 


Gin 


Leu 


Ser 


Gly 


Gly Met 


Gin 


Arg 


Lys 


1025 








1030 








1035 








104 


Leu 


Ser 


Val 


Ala 


Leu 


Ala 


Phe 


Val 


Gly 


Gly Ser 


Lys 


Val 


Val 


He 


Leu 



1045 1050 1055 
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Asp Glu Pro Thr Ala Gly Val Asp Pro Tyr Ser Arg Arg Gly^Ile Trp 

1060 1065 1070 

Glu Leu Leu Leu Lys Tyr Arg Gin Gly Arg Thr He He Leu Ser Thr 

1075 1080 1085 

Hi a His Met Asp Glu Ala Asp Val Leu Gly Asp Arg He Ala He He 

1090 1095 1100 

Ser His Gly Lys Leu Cys Cys Val Gly Ser Ser Leu Phe Leu Lys Asn 
1105 1110 1115 1120 

Gin Leu Gly Thr Gly Tyr Tyr Leu Thr Leu Val Lys Lys Asp Val Glu 

1125 1130 1135 

Ser Ser Leu Ser Ser Cys Arg Asn Ser Ser Ser Thr Val Ser Tyr Leu 

1140 1145 1150 

Lys Lys Glu Asp Ser Val Ser Gin Ser Ser Ser Asp Ala Gly Leu Gly 

1155 1160 1165 

Ser Asp His Glu Ser Asp Thr Leu Thr He Asp Val Ser Ala He Ser 

1170 1175 iieo 

Asn Leu lie Arg Lys His Val Ser Glu Ala Arg Leu Val Glu Asp He 
1185 1190 1195 1200 

Gly His Glu Leu Thr Tyr Val Leu Pro Tyr Glu Ala Ala Lys Glu Gly 

1205 1210 1215 

Ala Phe Val Glu Leu Phe His Gin He Asp Asp Arg Leu Ser Asp Leu 

1220 1225 1230 

Gly He Ser Ser Tyr Gly He Ser Glu Thr Thr Leu Glu Glu He Phe 

1235 1240 1245 

Leu Lys Val Ala Glu Glu Ser Gly Val Asp Ala Glu Thr Ser Asp Gly 

1250 1255 1260 

Thr Leu Pro Ala Arg Arg Asn Arg Arg Ala Phe Gly Asp Lys Gin Ser 
1265 1270 1275 1280 

Cys Leu Arg Pro Phe Thr Glu Asp Asp Ala Ala Asp Pro Asn Asp Ser 

1285 1290 129S 

Asp He Asp Pro Glu Ser Arg Glu Thr Asp Leu Leu Ser Gly Met Asp 

1300 1305 1310 

Gly Lys Gly Ser Tyr Gin Val Lys Gly Trp Lys Leu Thr Gin Gin Gin 

1315 1320 1325 

Phe Val Ala Leu Leu Trp Lys Arg Leu Leu He Ala Arg Arg Ser Arg 

1330 1335 1340 

Lys Gly Phe Phe Ala Gin lie Val Leu Pro Ala Val Phe Val Cys He 
1345 1350 1355 1360 

Ala Leu Val Phe Ser Leu He Val Pro Pro Phe Gly Lys Tyr Pro Ser 

1365 1370 1375 

Leu Glu Leu Gin Pro Trp Met Tyr Asn Glu Gin Tyr Thr Phe Val Ser 

1380 1385 1390 

Asn Asp Ala Pro Glu Asp Thr Gly Thr Leu Glu Leu Leu Asn Ala Leu 

1395 1400 1405 

Thr Lys Asp Pro Gly Phe Gly Thr Arg cys Met Glu Gly Asn Pro He 

1410 1415 1420 

Pro Asp Thr Pro Cys Gin Ala Gly Glu Glu Glu Trp Thr Thr Ala Pro 
1425 1430 1435 1440 

Val Pro Gin Thr He Met Asp Leu Phe Gin Asn Gly Asn Trp Thr Met 

1445 1450 1455 

Gin Asn Pro Ser Pro Ala Cys Gin Cys Ser Ser Asp Lys He Lys Lys 

1460 1465 1470 

Met Leu Pro Val Cys Pro Pro Gly Ala Gly Gly Leu Pro Pro Pro Gin 

1475 1480 1485 

Arg Lys Gin Asn Thr Ala Asp He Leu Gin Asp Leu Thr Gly Arg Asn 
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1490 1495 1500 

lie Ser Asp Tyf Leu Val hy& Thr Tyr Val Gin lie lie Ala Lys ser 
1505 1510 1515 1S20 

liexL itys Asn Lys lie Trp Val Asn Glu Plie Arg Tyr Gly Gly Phe Ser 

1525 1530 1535 

Leu Gly Val Ser Asn Thr Gin Ala Leu Pro Pro Ser Gin Glu Val Asn 

1540 1545 1550 

Asp Ala lie Lys Gin Met Lys Lys His Leu Lys Leu Ala Lys Asp Ser 

1555 1560 1565 

Ser Ala Asp Arg Phe Leu Asn Ser Leu Gly Arg Phe Met Thr Gly Leu 

1570 1575 1580 

Asp Thr Arg Asn Asn Val Lys Val Trp Phe Asn Asn Lys Gly Trp His 
1585 1590 1595 1600 

Ala lie Ser Ser Phe Leu Asn Val XI e Asn Asn Ala lie Leu Arg Ala 

1605 1610 1615 

Asn Leu Gin Lys Gly Glu Asn Pro Ser His Tyr Gly lie Thr Ala Phe 

1620 1625 1S30 

Asn His Pro Leu Asn Leu Thr Lys Gin Gin Leu Ser Glu Val Ala Leu 

1635 1640 1645 

Met Thr Thr Ser Val Asp Val Leu Val Ser lie Cys Val He Phe Ala 

1650 1655 1660 

Met Ser Phe Val Pro Ala Ser Phe Val Val Phe Leu lie Gin Glu Arg 
1665 1670 1675 1680 

Val Ser Lys Ala Lys Hi a Leu Gin Phe He Ser Gly Val Lys Pro Val 

1685 1690 1695 

He Tyr Trp Leu Ser Asn Phe Val Trp Asp Met Cys Asn Tyr Val Val 

1700 1705 1710 

Pro Ala Thr Leu Val He He He Phe He Cys Phe Gin Gin Lys Ser 

1715 1720 1725 

Tyr Val Ser Ser Thr Asn Leu Pro Val Leu Ala Leu Leu Leu Leu Leu 

1730 1735 1740 

Tyr Gly Trp Ser He Thr Pro Leu Met Tyr Pro Ala Ser Phe Val Phe 
1745 1750 1755 1760 

Lys He Pro Ser Thr Ala Tyr Val Val Leu Thr Ser Val Asn Leu Phe 

1765 1770 1775 

He Gly He Asn Gly Ser Val Ala Thr Phe Val Leu Glu Leu Phe Thr 

1780 1785 1790 

Asp Asn Lys Leu Asn Asn He Asn Asp He Leu Lys Ser Val Phe Leu 

1795 1800 1805 

He Phe pro His Phe cys Leu Gly Arg Gly Leu He Aep Met Val Lys 

1810 1815 1820 

Asn Gin Ala Met Ala Asp Ala Leu Glu Arg Phe Gly Glu Asn Arg Phe 
1825 1830 1835 1840 

Val Ser Pro Leu Ser Trp Asp Leu Val Gly Arg Asn Leu Phe Ala Met 

1845 1850 1855 

Ala Val Glu Gly Val Val Phe Phe Leu He Thr Val Leu He Gin Tyr 

1860 1865 1870 

Arg Phe Phe lie Arg Pro Arg Pro Val Asn Ala Lys Leu Ser Pro Leu 

1875 1880 1S85 

Asn Asp Glu Asp Glu Asp Val Arg Arg Glu Arg Gin Arg He Leu Asp 

1890 1895 1900 

Gly Gly Gly Gin Asn Asp He Leu Glu He Lys Glu Leu Thr Lys He 
1905 1910 1915 1920 

Tyr Arg Arg Lys Arg Lys Pro Ala Val Asp Arg He Cys Val Gly He 
1925 1930 1935 
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Pro Pro Gly Glu Cys Phe Gly Leu Leu Gly Val Asn Gly Ala Gly Lys 

1940 1945 19S0 

Ser Ser Thr Phe Lys Met Leu Thr Gly Aap Thr Thr Val Thr Arg Gly 

1955 1960 1965 

Asp Ala Phe Leu Asn Lys Asn Ser lie Leu Ser Asn lie His Glu Val 

1970 1975 1980 

His Gin Asn Met Gly Tyr Cys Pro Gin Phe Asp Ala He Thr Glu Leu 
1985 1990 1995 2000 

Leu Thr Gly Arg Glu His Val Glu Phe Phe Ala Leu Leu Arg Gly Val 

2005 2010 2015 

Pro Glu Lys Glu Val Gly Lys Val Gly Glu Trp Ala lie Arg Lys Leu 

2020 2025 2030 

Gly Leu Val Lys Tyr Gly Glu Lys Tyr Ala Gly Asn Tyr Ser Gly Gly 

2035 2040 2045 

Asn Lys Arg Lys Leu Ser Thr Ala Met Ala Leu He Gly Gly Pro Pro 

2050 205S 2060 

Val val Phe Leu Asp Glu Pro Thr Thr Gly Met Asp Pro Lys Ala Arg 
2065 2070 2075 2080 

Arg Phe Leu Trp Asn Cys Ala Leu Ser Val Val Lya Glu Gly Arg Ser 

2085 2090 2095 

Val Val Leu Thr Ser His Ser Met Glu Glu Cys Glu Ala Leu Cys Thr 

2100 2105 2110 

Arg Met Ala He Met val Asn Gly Arg Phe Arg Cys Leu Gly Ser Val 

2115 2120 2125 

Gin His Leu Lys Asn Arg Phe Gly Asp Gly Tyr Thr He Val Val Arg 

2130 2135 2140 

He Ala Gly Ser Asn Pro Asp Leu Lys Pro Val Gin Asp Phe Phe Gly 
2145 2150 2155 2160 

Leu Ala Phe Pro Gly Ser Val Leu Lys Glu Lys His Arg Asn Met Leu 

2165 2170 2175 

Gin Tyr Gin Leu Pro Ser Ser Leu Ser Ser Leu Ala Arg He Phe Ser 

2180 2185 2190 

He Leu Ser Gin Ser Lys Lys Arg Leu His He Glu Asp Tyr Ser Val 

2195 2200 2205 

Ser Gin Thr Thr Leu Asp Gin Val Phe Val Asn Phe Ala Lys Asp Gin 

2210 2215 2220 

Ser Asp Asp Asp His Leu Lys Asp Leu Ser Leu His Lys Asn Gin Thr 
2225 2230 2235 2240 

Val Val Asp Val Ala Val Leu Thr Ser Phe Leu Gin Asp Glu Lys Val 

2245 2250 2255 

Lys Glu Ser Tyr Val 
2260 

<210> 2 . 

<211> 7864 

<212> DNA 

<213> Homo sapiens 

<400> 2 

gtccctgctg tgagctctgg ccgctgcctt ccagggctcc cgagccacac gctgggggtg 60 

ctggctgagg gaacatggct tgttggcctc agctgaggtt gctgctgtgg aagaacctca 12 0 

ctttcagaag aagacaaaca tgtcagctgt tactggaagt ggcctggcct ctatttatct 18 0 

tcctgatcct gatctctgtt cggctgagct acccacccta tgaacaacat gaatgccatt 24 0 

ttccaaataa agccatgccc tctgcaggaa cacttccttg ggttcagggg attatctgta 3 00 

atgccaacaa cccctgtttc cgttacccga ctcctgggga ggctcccgga gttgttggaa 3 60 
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actttaacaa atccattgtg gctcgcctgt tctcagatgc tcggaggctt cttttataca 42 0 

gccagaaaga caccagcatg aaggacatgc gcaaagttct gagaacatta cagcagatca 480 

agaaatccag ctcaaacttg aagcttcaag atttcctggt ggacaatgaa accttctctg 540 

ggttcctgta tcacaacctc tctctcccaa agtctactgt ggacaagatg ctgagggctg 600 

atgtcattct ccacaaggta tttttgcaag gcfcaccagtt acatttgaca agtctgtgca 660 

atggatcaaa atcagaagag atgattcaac ttggtgacca agaagtttct gagctttgtg 720 

gcctaccaag ggagaaactg gctgcagcag agcgagtact tcgttccaac atggacatcc 780 

tgaagccaat cctgagaaca ctaaactcta catctccctt cccgagcaag gagctggctg 84 0 

aagccacaaa aacattgctg catagtcttg ggactctggc ccaggagctg ttcagcatga 90 0 

gaagctggag tgacatgcga caggaggtga tgtttctgac caatgtgaac agctccagct 960 

cctccaccca aatctaccag gctgtgtctc gtattgtctg cgggcatccc gagggagggg 102 0 

ggctgaagat caagtctctc aactggtatg aggacaacaa ctacaaagcc ctctttggag 10 80 

gcaatggcac tgaggaagat gctgaaacct tctatgacaa ctctacaact ccttactgca 114 0 

atgatttgat gaagaatttg gagtctagtc ctctttcccg cattatctgg aaagctctga 12 00 

agccgctgct cgttgggaag atccfcgtata cacctgacac tccagccaca aggcaggtca 1260 

tggctgaggt gaacaagacc ttccaggaac tggctgtgtt ccatgatctg gaaggcatgt 1320 

gggaggaacfc cagccccaag atctggacct tcatggagaa cagccaagaa atggaccttg 13 80 

tccggatgct gttggacagc agggacaatg accacttttg ggaacagcag ttggatggct 1440 

tagattggac agcccaagac atcgtggcgt ttttggccaa gcacccagag gatgtccagt 15 0 0 

ccagtaatgg ttctgtgtac acctggagag aagctttcaa cgagactaac caggcaatcc 15 60 

ggaccatatc tcgcttcatg gagtgtgfcca acctgaacaa gctagaaccc atagcaacag 162 0 

aagtctggct catcaacaag tccatggagc tgctggatga gaggaagttc tgggctggta 1680 

ttgtgttcac tggaattact ccaggcagca ttgagctgcc ccatcatgtc aagtacaaga 174 0 

tccgaatgga catfegacaat gtggagagga caaataaaat caaggatggg tactgggacc 18 00 

ctggtcctcg agctgacccc tttgaggaca tgcggtacgt ctgggggggc ttcgcctact 1860 

tgcaggatgt ggtggagcag gcaatcatca gggtgctgac gggcaccgag aagaaaactg 1920 

gdgtgtctat atgcaacaga tgccctatcc ctgttacgtt gatgacatct ttctgcgggt 1980 

gatgagccgg tcaatgcccc tcttcatgac gctggcctgg atttactcag tggctgtgat 2 04 0 

catcaagggc atcgtgtatg agaaggaggc acggctgaaa gagaccatgc ggatcatggg 2100 

cctggacaac agcatectct ggtttagctg gtfccattagt agcctcattc ctcttcttgt 2160 

gagcgctggc ctgctagtgg tcatcctgaa gttaggaaac ctgctgccct acagtgatcc 2220 

cagcgtggtg tttgtcttcc tgtccgtgtt tgctgtggtg acaatcctgc agtgcttcct 22 80 

gattagcaca ctcttctcca gagccaacct ggcagcagcc tgtgggggca tcatctactt 2340 

cacgctgtac ctgccctacg tcctgtgtgt ggcatggcag gactacgtgg gcttcacact 2400 

caagatcttc gctagcctgc tgtctcct-gt ggcttttggg tttggctgtg agtactttgc 24 6 0 

cctttttgag gagcagggca ttggagtgca gtgggacaac ctgtttgaga gtcctgtgga 252 0 

ggaagatggc ttcaatctca ccacttcggt ctccatgatg ctgtttgaca ccttcctcta 2580 

tggggtgatg acctggtaca ttgaggctgt ctttccaggc cagtacggaa ttcccaggcc 264 0 

ctggtatttt ccttgcacca agtcctactg gtttggcgag gaaagtgatg agaagagcca 2700 

ccctggttcc aaccagaaga gaatatcaga aatctgcatg gaggaggaac ccacccactt 2760 

gaagctgggc gtgtccattc agaacctggt aaaagtctac cgagatggga tgaaggtggc 2 82 0 

tgtcgatggc ctggcactga atttttatga gggccagatc acctccttcc tgggccacaa 2880 

tggagcgggg aagacgacca ccatgtcaat cctgaccggg ttgttccccc cgacctcggg 2940 

caccgcctac atcctgggaa aagacattcg ctctgagatg agcaccatcc ggcagaacct 3000 

gggggtctgt ccccagcata acgtgctgtt tgacatgctg actgtcgaag aacacatctg 3060 

gttctatgcc cgcttgaaag ggctctctga gaagcacgtg aaggcggaga tggagcagat 3120 

ggccctggat gttggtttgc catcaagcaa gctgaaaagc aaaacaagcc agctgtcagg 3180 

tggaatgcag agaaagctat ctgtggactt ggcctttgtc gggggatcta aggttgtcat 3240 

tctggatgaa cccacagctg gtgtggaccc ttactcccgc aggggaatat gggagctgct 3300 

gctgaaatac cgacaaggcc gcaccattat tctctctaca caccacatgg atgaagcgga 3360 

cgtcctgggg gacaggattg ccatcatctc ccatgggaag ctgtgctgtg tgggctcctc 342 0 

cctgtttctg aagaaccagc tgggaacagg ctactacctg accttggtca agaaagatgt 34 80 

ggaatcctcc ctcagttcct gcagaaacag tagtagcact gtgtcatacc tgaaaaagga 354 0 

ggacagtgtt tctcagagca gttctgatgc tggcctgggc agcgaccatg agagtgacac 3 600 

gctgaccatc gatgtctctg ctatctccaa cctcatcagg aagcatgtgt ctgaagcccg 3 660 
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gctggtggaa gacatagggc atgagctgac ctatgtgctg ccatatgaag ctgctaagga 3 720 

gggagccttt gtggaactct ttcatgagat tgatgaccgg ctctcagacc tgggcatttc 3780 

tagttatgga atctcagaga cgaccctgga agaaatattc ctcaaggtgg ccgaagagag 3 84 0 

tggggtggat gctgagacct cagatggtac cttgccagca agacgaaaca ggcgggcctt 3 900 

cggggacaag cagagctgtc ttcgcccgtt cactgaagat gatgctgctg atccaaatga 3 960 

ttgctgacat agacccagaa tccagagaga cagacttgct cagtgggatg gatggcaaag 4 0 20 

ggtcctacca ggtgaaaggc tggaaactta cacagcaaca gtttgtggcc cttttgtgga 4 0 80 

agagactgct aattgccaga cggagtcgga aaggattttt tgctcagatt gtcttgccag 4140 

ctgtgtttgt ctgcattgcc cttgtgttca gcctgatcgt gccacccttt ggcaagtacc 4200 

ccagcctgga acttcagccc tggatgtaca acgaacagta cacatttgtc agcaatgatg 4 2 60 

ctcctgagga cacgggaacc ctggaactct taaacgccct caccaaagac cctggcttcg 432 0 

ggacccgctg tatggaagga aacccaatcc cagacacgcc ctgccaggca ggggaggaag 43 80 

agtggaccac tgccccagtt ccccagacca tcatggacct cttccagaat gggaactgga 4440 

caatgcagaa cccttcacct gcatgccagt gtagcagcga caaaatcaag aagatgctgc 4 500 

ctgtgtgtcc cccaggggca ggggggctgc ctcctccaca aagaaaacaa aacactgcag 4560 

atatccttca ggacctgaca ggaagaaaca tttcggatta tctggtgaag acgtatgtgc 4 620 

agatcatagc caaaagctta aagaacaaga tctgggtgaa tgagtttagg tatggcggct 4 680 

tttccctggg fcgtcagtaat actcaagcac ttcctccgag tcaagaagtt aatgatgcca 4 7 40 

tcaaacaaat gaagaaacac ctaaagctgg ccaaggacag ttctgcagat cgatttctca 4 800 

acagcttggg aagatttatg acaggactgg acaccagaaa taatgtcaag gtgtggttca 4 860 

ataacaaggg ctggcatgca atcagctctt tcctgaatgt catcaacaat gccattctcc 4 32 0 

gggccaacct gcaaaaggga gagaacccta gccattatgg aattactgct ttcaatcatc 4 980 

ccctgaatct caccaagcag cagctctcag aggtggctct gatgaccaca tcagfcggatg 5 040 

tccttgtgtc catctgtgtc atctttgcaa tgtccttcgt cccagccagc tttgtcgtat 5100 

tcctgatcca ggagcgggtc agcaaagcaa aacacctgca gttcatcagt ggagtgaagc 5160 

Gtgtcatcta ctggctctct aattttgtct gggatatgtg caattacgtt gtccctgcca 5220 

cactggtcat tatcatcttc atctgcttcc agcagaagtc ctatgtgtcc tccaccaatc 5280 

tgcctgtgct agcccttcta cttttgctgt atgggtggtc aatcacacct ctcatgtacc 5340 

cagcctcctt tgtgttcaag atccccagca cagcctatgt ggtgctcacc agcgtgaacc 5400 

tcttcattgg cattaatggc agcgtggcca cctttgtgct ggagctgttc accgacaata 5460 

agctgaataa tatcaatgat atcctgaagt ccgtgttctt gatcttccca catttttgcc 5520 

t^Sggacgagg gctcatcgac atggtgaaaa accaggcaat ggctgatgcc ctggaaaggt 5 580 

t'tggggagaa tcgctttgtg tcaccattat cttgggactt ggtgggacga aacctcttcg 5640 

ccatggccgt ggaaggggtg gtgttcttcc tcattactgt tctgatccag tacagattct 5700 

tcatcaggcc cagacctgta aatgcaaagc tatctcctct gaatgatgaa gatgaagatg 5760 

tgaggcggga aagacagaga attcttgatg gtggaggcca gaatgacatc ttagaaatca 5 820 

aggagttgac gaagatatat agaaggaagc ggaagcctgc tgttgacagg atttgcgtgg 5880 

gcattcctcc tggtgagtgc tttgggctcc tgggagttaa tggggctgga aaatcatcaa 5940 

ctttcaagat gttaacagga gataccactg ttaccagagg agatgctttc cttaacaaaa 6000 

ataggtatct tatcaaacat ccatgaagta catcagaaca tgggctactg ccctcagttt 606 0 

gatgccatca cagagctgtt gactgggaga gaacacgtgg agttctttgc ccttttgaga 6120 

ggagtcccag agaaagaagt tggcaaggtt ggtgagtggg cgattcggaa actgggcctc 6180 

gtgaagtatg gagaaaaata tgctggtaac tatagtggag gcaacaaacg caagctctct 6240 

acagccatgg ctttgatcgg cgggcctcct gtggtgtttc tggatgaacc caccacaggc 63 00 

atggatccca aagcccggcg gttcttgtgg aattgtgccc taagtgttgt caaggagggg 6360 

agatcagtag tgcttacatc tcatagtatg gaagaatgtg aagctctttg cactaggatg 6420 

gcaatcatgg tcaatggaag gttcaggtgc cttggcagtg tccagcatct aaaaaatagg 64 80 

tttggagatg gttatacaat agttgtacga atagcagggt ccaacccgga cctgaagcct 6540 

gtccaggatt tctttggact tgcatttcct ggaagtgttc taaaagagaa acaccggaac 6600 

atgctacaat accagcttcc atcttcatta tcttctctgg ccaggatatt cagcatcctc 6660 

tcccagagca aaaagcgact ccacatagaa gactactctg tttctcagac aacacttgac 6720 

caagtatttg tgaactttgc caaggaccaa agtgatgatg accacttaaa agacctctca 6780 

ttacacaaaa accagacagt agtggacgtt gcagttctca catcttttct acaggatgag 684 0 

aaagtgaaag aaagctatgt atgaagaatc ctgttcatac ggggtggctg aaagtaaaga 6900 

ggaactagac tttcctttgc, accatgtgaa gtgttgtgga gaaaagagcc agaagttgat 6960 
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gtgggaagaa gtaaactgga 
atgaaaacaa aattccatta 
caagtgaaag acttgaattt 
ccaatggaca tatgggtttg 
cattggggtt gcaacaataa 
aaaaggtaat gcacatcctc 
gtgacacatc cattgctggc 
agtttgaagc accatggtgt 
atcaacattg aatatcagtt 
tccctctgat aagctgttct 
cacgggaaac ttggttccat 
tcatccttat gagactctta 
caaactgctg gggctgcaac 
aaacctaggg aagcctgtgc 
tcaagatgtt tatctgacac 
gaaa 



tactgtactg atactattca atgcaatgca attcaatgca 
caggggcagt gcctttgtag cctatgtctt gtatggctct 
agttttttac ctatacctat gtgaaactct attatggaac 
aactcacact tttttttttt tttttgttcc tgtgtattct 
ttcatcaagt aatcatggcc agcgattatt gatcaaaatc 
attcactaag ccatgccatg cccaggagac tggtttcccg 
aatgagtgtg ccagagttat tagtgccaag tttttcagaa 
gtcatgctca cttttgtgaa agctgctctg ctcagagtct 
gacagaatgg tgccatgcgt ggctaacafcc ctgctttgat 
ggtggcagta acatgcaaca aaaatgtggg tgtctccagg 
tgttatattg tcctatgctt cgagccatgg gtctacaggg 
aatatiactta gatcctggta agaggcaaag aatcaacagc 
tgctgaagcc agggcatggg attaaagaga ttgtgcgttc 
ccatttgtcc tgactgtctg ctaacatggt acactgcatc 
aagtgtafcta tttctggctt tttgaattaa tctagaaaat 



7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7864 



<210> 3 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gcagagggca tggctttatt tg 22 

<2ia> 4 

<211> 24 

<212> DKA 

<213> Homo sapiens 



<210> 5 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gaaagtgact cacttgtgga gga 23 

<210> 6 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 4 

ctgccaggca ggggaggaag 



agtg 



24 



<400> 6 

aaaggggctt ggtaagggta 



20 



<210> 7 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 7 

catgcacatg cacacacata 



20 



<210> 8 

<2X1> 27 

<212> DNA 

<213> Homo sapiens 



<400> 8 

ctttctgcgg gtgatgagcc ggtcaat 



27 



<210> 9 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 9 

ccttagcccg tgttgagcta 



20 



<210> 10 

<211> 26 

<212> DNTA 

<213> Homo sapiens 



<400> 10 

cctgtaaatg caaagctatc tcctct 



26 



<210> 11 

<;211> 26 

<212> DNA 

< 2 1 3 > Homo s ap i ens 

<400> 11 

cgtcaactcc ttgatttcta agatgt 

<210> 12 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 12 

gggttcccag ggttcagtat 

<2io> 13 
<211> 21 
<212> DNA 

<213> Homo sapiens 



20 



<400> 13 

gatcaggaat tcaagcacca a 



21 



<210> 14 

<211> 10545 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc__feature 
<222> (1) . . . (10545) 
<223> n = a, c, or g 

<400> 14 

acctcttata gaatgataga attcctctgg aatgattgga taacttcatt tcatccttga SO 

cttttacctt ggaggatttc ttaccccttt tggcttcfcca' aatttgacta ttaaaatgtt 12 0 

gcctttaaaa ataggaacac agtttcaggg gggagtacca gcccatgacc cttctgcaag ISO 

gcccGctaac tcaaggtagt ttccctggaa ctgtggttta tggaatgttt caggagtgtg 240 

aggaggtata atttaaggct gtcctagcaa ggataccctt aaggatagag ggcccagtag 300 

catctggagg ccagaaaagt taaactgagg cagtcagatt agcttcaggc tcaattaagc 3 60 

tgatgggtca gcctgggaga aattgcagga tgactctcaa tatcccctcc cacccccaca 42 0 

gcagccacga tctgtctgtc tttaatcatg ggtgcagtga acctgttctt tccaggtgtc 480 

ttggcctt-ca gtaaccttgt taggcttgtc cctgaacgtg gctaccgatc caaagacaca 540 

tgatcagaga ggcaattaga gaacagacct tttccaaagc aagcatgttc tgttgggctt 600 

agaagtttca tgtcctaata ttataggacc ctgtgcatct ctctggagat gaggcacatg 660 

agtcatatct gtgattcttg cttttgtgtc aacatctcat gaataggcaa tcagagcttt 720 

ggcaccaatg tattttcagt tcatatctga tgtagttaaa tccacctcct gcfcttgtagt 780 

ttactggcaa gctgtttttg atataagaca tctagaacac tgtaaatata taacattttt S40 

atttgtctat tatacctcaa ttacgaaaaa gacatctaga agcaacctca tcaagagaga 900 

tactgaggcc gggcatggta gctcacactt gcaatcccat tactttggga ggctgaggca 960 

ggtagatcac ttgaggtcaa gagtttgaaa ccagcctggc caacatgttg aaaccctgtc 1020 

tctattaaaa atacaaaaaa gttagctggg cttggtggtg ggcacctgta atcccagcta 1080 

ctccggaggc tgaggcagga gaatcacttg aacctgggag gcagaggttg cagtgagctg 114 0 

agatcacacc actgcactcc aacctgggca ccagagtgag attacatcta aaaaataaaa 1200 

taaagtaata aaaaagagag. atattgatag ctgttgttgg aaatttcaac ttccatctca 1260 

cttctggtaa ctttttggaa gtttgttgaa caaagtggaa tacacgcaca fcacacacaca 1320 

cacatactct cttgtttgtt taaggtttaa tgaaatagct gtcatataat cactgttttt 13 BO 

gaaagaggag aattagttgc tatctgtaca ttttgggtat gtgaactatt tggatagaac 144 0 

tctgagaaat gcattcagaa caacaaacaa aatcatagga gaaatagcta agtgggaagg 1500 

ggcatataag agttgttgaa aaagttattt cttgagaaac cagctctaat gctaggcaag 1560 

tcacttgctt tgggggaggc ctcagcttct ctgtctataa gattgcagca ggggtgtagt 1620 

gggaatgagt cttcaacatt ccaagagatt ttatctacta atacgacagt caaatggagc 1680 

atgactttgt ggaagcctct cctcttccac ccagaggggc caatttctct gtcccagtga 1740 

gatgttgaca cttgtatgat ccctgcttgg agacttccct cttctggaac ctgcGctggc 1800 

tcaggcatga gggctgactg tcacccttcg ataggagccc agcactaaag ctcatgtgtt 1860 

ggcagtgttc ttgcgggaag gaaaaagacc agccagccca tttgttactg cacaagcaaa 1920 

cagcttctgg tagctgtaca gatacatgca ctttctttcc tcactgtgtt tccatagaca 1980 

gatttagtgc tgtagaagag tagagggcag tcacgggaag gagttcctgt ttttcttttg 2 040 

gctatgccaa atggggaaaa atcctcctat cttgtctttt tagtgtcatc ctctctcccc 210 0 

ttttcttctfc ctttataatt ctcatctctc atctctcctg gaaatgtgca tgtcaagttc 2160 

aaaagggcac aatgttttgg tgaggaagag gtgggagaac acgtgccagg tgctaactag 222 0 

ggtcatcatt tcccccttca cagccagctt cctgtgaatg tgtgtgtgtg tgtgtgtgtg 2280 

tgtgtgtgtg tgtgtgtgtg tgtgtatttc ttttgccagc atcactgaat ctgtctgctg 2340 

tctggtattc caggttttgg tttagggaaa agtaaaagta attttataat cccagctgtc 2400 

atttaagcca cccctttgtg ggtagcatat ggtccactct ctcagttcat tgtcctaaag 2460 

atgcttcatc agaaaggaat aacttccacc ccgttactct ctgtcccctt actctgcttt 2520 

atttttcttc gtcaatccta ccaccaccac ccactgtttg aacaacccac tattatttgt 2580 

ctgtttccca tccctggtag aataggagcc ccatgaatga aggaactttg cttctgttgt 2640 

tcaccactga atctctaagg tatggaacac acctggcatg tgataggcac tcgataaata 2700 

tttgttgtgg ctcatgggca ccttgcagag ttaaggctgc agttgtttgt ggaatttata 2760 

agtggtaatg aatatttatc tactattcct cttccaaggc gatcacacaa taatcaggct 2820 

ttacactatc cagttcttag gtcttccaag ttatgacttg tgaggtatgt taattatgat 2880 

aatagaaggc agtttatttg gttcagattt attgatgtgt aatttaccac agtaagactt 2 94 0 
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cccctttaca aaagtatgat gagttttgac aaatggatac acatgtgtat ctaccactgc 3 000 

catgctcctt ttcagtctgt cgccccctcc acccatgacc actggtcacc actgcagtga 3060 

tttctgtccc cttcatttca ccttttccag aatgtcatat aaatggaatc atgcagtatg 3120 

tagctttttg tgtctggctt atttttctta gcattaggct tttgggattc atccaggttg 3180 

tcgcatgtaa cagtagctta ttccttttta tggctgagta agtgtcccag ttttatttat 324 0 

atatttattt atgaggaggt gtctcactct gtcacccagg ctggagtgcg gtagcgcgat 3 3 00 

ctcagctcac tgcaacctcc gcctcccagg tteaagcaat tctcctgcct cctgagtagc 33 60 

tgggattaca ggcacccacc gccacgccca actaattttt atatttttag tagagatggg 342 0 

gtttcaccat gttggccagg ctgatctcaa actcttgacc tcaggtgatc cgcccacctc 3480 

tggctcccaa agtgctagga ttacaggcat gagccactgt gcccagcccc agttttattt 3 540 

attcaccagt tgatggtctt ttcgacaact aattgtttcc agtttttggc tattctgtat 3 600 

aaggcttcta taaatattca caaataccta ggatgggatg actgggtcat ataatagtac 3 660 

tgtataacct tagcagaaac tgtcaaacta ttttccaaag tggctcttcc attttacaat 3720 

tccacagtgt attgagtccc agtgtctcca tacacatgct agcactttta atatttaatt 3780 

tagtgggtat gtaatgatat ctcattgtgg ttttaatttg catttctctg cagctaatga 3 84 0 

tgagtgtttc tgcttatttg ggaaggtttt aatttagcag tctgttgtat tctgtagata 3900 

ttaataactt caaaatatca gtggcatttg cagttaaaat ttccttaaaa aattggccaa 3 960 

aggtttccag cagtcacttc tgccatgccc aaactgtatg aaacaaggct gaggtgtgga 4020 

gattgtcaca ttttggcaag gagtgatcca cttgggtgac tgatgagacc cagagagcgt 408 0 

acgcctcggg cttgagggtg aggacgggcg ggaagtcgac tgcatggccc tgctggcctt 414 0 

gggaggctgc ccagtcctta gctaaagctg gcagttatgg gaaacagact tagattctat 4200 

tacgtttttc aggatgtccc aggagtcacc tgggaagctc agcagtcctt tgtgactttc 4260 

aagcatatgg tagaagctgc tgaacacaga gctccctctt tggggataat ttgcccaaat 432 0 

catttaatca ggcttgagaa atgagttacc acaggtccag gagtgctgcc acccttgaat 43 80 

tctgacaccc tatttctcct atccgtctct taattaatta agcagacatc cccaagtgct 4440 

tacgacaagc caggaccctt ttgcatacta aggaaaacag ggatgaagga aacagaaatg 4500 

gtctctgctc tgactcagaa ggtagaaatc ctctttccca gccaagtctt cctagggagc 4560 

acgtaggaag ggctctgaac ccacgtgtca gttgcagggg aggatatcag gaaaggacat 4620 

tgaagaagtg gagacctaag tttgagacct aggcattagc caggctagca gtgcttgaaa 468 0 

aagtgtctta ggacaagaga actcaccagt gaagtcccag tggtaggaga gcgtgcagca 4 740 

tattctgagc ctgtatacac atctccaggg cattgcttag caggtgggga gtggcaagag 4800 

agtaggctgg agtcacagaa gggaggccag gtagaccttg gtgagcactg gactctatgt 4 860 

tcaggtgctg aggagctggc aaaaggtttt aagtcgggga gaggcatgtt cagatatttg 4920 

gtctagctga gtaactttgg gtgctctgtg acaaatggtt gggagaccag tgaggtggca 498 0 

gttgcggtca tctaggagca ggatciagagt ggcctattga ctgggatgac tgtgaagtgg 5040 

gatcctttcc agccagtaac tggaaatgtg tatgagggca gaagtgagtg tactgcattt 3100 

gaaacattga gaaatctagt acatagtact gtctctttta tatctttttt tttttttttt 5160 

ttgattttgg tttgtttgtt c!actaacttg gaaaactgat gtggaaatgt ccctttggct 5220 

tcagttacct gagcagaagg ggccgggcat tgccaaactc tcctcttagg acagaattgc 52 80 

tcccagtatt gatcattgtg ttctgagttg ggggagcaaa ttgtgcagga ggccaggtca S340 

gtgccaaggt gggtgggagg aattggagca ggaagcttgc ctaagtgtgc ccagcaaagc 5400 

cacggtagaa ctttctactg fcggctctatg ctacttctta gcaaccttct ccatgtgctt 5460 

cctggagagt ccttggagtc agaacctttt tcttgaaacc cagacacttt acttccaaga 5520 

aaatgctgtc caagaaaact catdcttccc ttcttctcat gaacgttgtg tagaggtgtg 55 80 

tcttctcttc ctttgagctt ttccactcag ggtttagggg aggtgatatt ctatatttgg 564 0 

gtttggctct gggtactgca acactaggct attaagattt catccttact gctttgcccc 5700 

tcctatcttt ccagaaaccc acaatggatt tgctagaaafc aatggaacgt cctgtttgga 5760 

caggatataa ccatttctca gctagaggat attgttggaa tgaagaaaga taaatgggga 582 0 

gaagggaact cacattgcfct tggcacttaa attaagccat gtactgtgtt gggaaattat 5880 

ttatattatc tcgttgaatc cacagtagaa cacagttgaa caccatacaa ggtaagtatt 5940 

gtcatcctta ttttaccatg aggaaattga tgcttagaga. gcataaagcc ttggccaggg 6000 

gcacatagtt gggaagccgg ggctaattca tgcctgggct Gtttctgata gttttccttt 6060 

tttaattgtc ccctcctcat tgtfcaccttg gggatttcaa gagattcatg tagcttctaa 6120 

atcaacgaac tgattCGtgg agagcagctt ctgtatgaga aaaatctagc taattattta 6180 

tttcagtgtc tctggaatgc aagctctgtc Gtgagccact tagaaaacaa tttgggatga 6240 
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caagcatgtg tctcacaatg ctgctctggt tgccagtgct gtgctgccag ttgtcatctt 63 00 

tgaacaaact gatgcagtgc tggtttaact cttcctcttt ttggagtaag aaactttgga 6360 

ggcctgtgtc cttctagaag tttgctgagc aaatggtaag gaaaagaaat aggtcctaag 6420 

gcttgactat ttcagagaat ttcttgattt attggactgt caatgaatga attggaatac 64 80 

atagtggtag gctgtctttt cttctcagac actgcaattt cctccaatct cttgactttt 654 0 

ctagaagttt taatccaagt ccttgttggg tggtagataa aagggtattg ttctactaga 6600 

gactgacctt ggcatggaga tctcatttgg actcacagat ttctagtcta gcgcttggtt 666 0 

ttgtatccat acctcgctac tgcattctta gttccttctg ctccttgttc ctcatgccca 672 0 

gtgtcGcacc ctacccttgc ccctactcct ctagaggcca cagtgattca ctgagccatt 67 80 

tcataagcac agctaggaga gttcatggct accaagtgcc agcagggccg aabtttcacc 684 0 

tgtgtgtcct cccttccatt tttcatcttc tgccccctcc ccagctttaa ctttaatata 6000 

actacttggg actattccag cattaaataa gggtaactgc tggatgggtg gctgggatac 6960 

acagaatgta gtatcccttg ttcacgagaa gaccttcttg ccctagcatg gcaaacagtc 7 02 0 

ctccaaggag gcacctgtga cacccaacgg agtagggggg cggtgtgttc aggtgcaggt 70 80 

ggaacaaggc cagaagtgtg catatgtgct gaccatggga gcttgtttgt cggtttcaca 714 0 

gttgatgccc tgagcctgcc atagcagact tgttt.ctcca tgggatgctg ttttctttcc 7200 

agagacacag cgctagggtt gtcctcatta cctgagagcc aggtgtcggt agcattttct 7260 

tggtgtttac tcacactcat ctaaggcacg ttgtggtttt ccagattagg aaactgcttt 7320 

attgatggtg cttttttttt ttttttttga gacagagtct cgctctgtcg ccatgctgga 73 80 

gtgtagtggc acaatcttgg ctcactgcac ctccgcctgc caggttcagc gattctcctg 744 0 

cctcagcctc ccaagtagct gggactacag gtgcctgcca ccatgcccag ctaatttttg 7500 

tatttttagt agagacgggg tttcaccgta ttggctagga tggtctcgat ttcttgacct 7 560 

cgtgatccgc ctgcctcggc ctcccaaagt gctgggatta taggcttgag ccaccacgcc 7 620 

tggccgatgg tgctttttat catttgaagg actcagttgt ataacccact gaaaattagt 7680 

atgtaaggaa gttcagggaa tagtataagt cactccaggc ttgaggcaaa atttacaaat 7740 

gctgctgact ttgtatgtaa ggggaggcat tttcttagaa aagagaggta ggtctqtggg 7800 

attccagtat gccatttcca tcctcagtgt ttttggccac ctgagagagg tctattttca 7 8 60 

gaaatgcatt cttcattccc agatgataac atctatagaa ctaaaatgat taggaccata 7920 

acacgtagct cctagcctgc tgtcggaaca cctcccgagt ccctctttgt gggtgaaccc 7980 

agaggctggg agctggtgac tcatgatcca ttgagaagca gtcatgatgc agagctgtgt 8 040 

gttggaggtc tcagctgaga gggctggatt agcagtcctc attggtgtat ggctttgcag 8100 

. caataactga tggctgtttc ccctcctgct: ttatctttca gttaatgacc agccacggcg 81.60 

tccctgctgt gagctctggc cgctgccttc cagggctccc gagccacacg ctgggggtgc 822 0 

tggctgaggg aacatggctt gttggcctca gctgaggttg ctgctgtgga agaacctcac 8280 

tttcagaaga agacaaacag taagcttggg tttttcagca gcggggggtt ctctcatttt 8340 

ttctttgtgg ttttgagttg gggattggag gagggaggga gggaaggaag ctgtgttggt 84 00 

tttcacacag ggattgatgg aatctggctc ttatggacac agaactgtgt ggtccggata 84 6 0 

tggcatgtgg, cttatcatag agggcagatt tgcagccagg tagaaatagt agctttggtt 852 0 

tgtgctacitg cccaggcatg agttctgatc cctaggacct ggctccgaat cgcccctgag 85 80 

caccccactt tttccttttg ctgcagccct gggaccacct ggctctccaa aagcccctaa 8 64 0 

tgggcccctg tatttctgga agctgtgggt gaagtgagtt agtggcccca ctcttagaga 870 0 

tcaatactgg gtatcttggt gtcaatctgg attctttcct tcaggcctgg aggaatataa 8 760 

taactgagac ttgttttatt tctgcagagg gttctaagcc attcacttcc cagatgggcc 8820 

aataatgctt tgagtaatct ggagatcatc tttaatgcgc aggtgaatgg aactcttcca 8860 

c^gagggatg tgagggctgt agagcagagt gaactccctg aaactcagac gtcagctctt 8940 

tgtctctcta tctctgaaca cccttcctta gagatcccat ctctaggatg catttctctg 9000 

tagttagttt ctaagtctct tgttcctgtt ctgcctttat ttttttttcc tggattctaa 9060 

gccagtatcc ccacttggct gtcttaatgt agcttaacat gtctgtaatc aaaatgatca 912 0 

tctttctgag attcaaaggg ctataaggga ctttggagag aatttcattc agttttcctc 9180 

aaactagaat aatgcttgca ctgtctgtaa aagaacaaaa gtgtcaaagc atccttttgt 924 0 

tcactaaatt tcctttttta ttatagtgtt acttaaatat taggaagtta aaagtaggta 9300 

taaacttctt ataggctgtt attatacaac tatatgaccc atacatattt acaaattaag 9360 

tgcagccaaa attgcaaaat caataccatt caaattaata ccttaaatgt ggtgaggcag 942 0 

ctgttgttca actgaaacca aattataagt tgcatggcag taaatgctat catgctgatc 948 0 

attttgagtt tggccagtct atattatcat gcgctaatga ttgaattctc cacccatttt 9540 
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tctacttgta tgaccttaat ttgatggcac ctgttccatc ctcatgagtt tgctacaatt 9600 

atactggtgc caacacaatc ataaacacaa atataaactt gggctttgaa atcttgtgcc 9660 

agaacttggc tttaaagtaa gcatttaaaa aatccatatg tgtttattag actttgttta 9 72 0 

gatgactgtt gaaatgaaaa caaagtgttt aaaatcctct tagagaactt aaatataatc 978 0 

cctcagcaat atgtatacag atcttccttt gagaaaaact gattgtgttc agcctctcat 9840 

gttacaaatg gggaacctga attctgaggt ctctagtgag agaacaggga ctggaatctg 9900 

tggatcctat ctgttttaat aataattgta aagtataata gataatatta tattaaaaag 9960 

agagnnnnrm acacttagaa tgagcttcca tgtgtgaggc actaactgat taggcattat 10020 

taactagatt tattcctttt aaggccccgc gatgtactgt tatttccaca tgttgtagct 10080 

ggggaacgtg ctactcagag aggttaagta acttgtctga ggtccacacc actaacaagg 1014 0 

agcacaggta gggttcaaat ccagataatc tgactttgga gctggcactc taactcaatg 1020 0 

tgcctaatcg cttttcagtg gtgtcattat tttgcctatt ctccatctga gaatattgaa 10260 

gtttctgact ccttccttgc etttctccct gcctcccgtg gttatcccca ggtcttggtg 1032 0 

ttccagtcct ctatgtccgt ccttactctt attcctttgc tacagtgtga tccagggctc 103 80 

ctgcccttct tatcctggta gagggggccc acttgctggg aaattgtctc cgccatggtt 10440 

tatccatgtt gtgtgtccat tagtgagtag tgggaagaat catatcatgt tggcaatgaa 10500 

aggggggcta tggctctggg gtagtctagt ctgaactctt atttt 10545 



<210> 15 

<211> 4736 

<212> DNA 

<213> Homo sapiens 

<400> IB 

cttttttttt tttttttttt tttttttttfc tgaggtgaag tctcactctg ttgcccaggc 60 

tggagtgcaa tggagcgatc ttggctcacc ccaacctctg tctcctgggt tcaaacagtt 12 o 

ctcctgcctG agectcccga gtagctggga ttacaggctc ccgccaccat gcccagctat 180 

ttttttgtat tttcagtaga gatggggttt cacccttttg accaggctgg tcttgaactc 240 

ctgacctcat gatcaaccca cctcagcctc ccaaagtgct gggattacag gtgtgagcca 3 00 

ccacgcccgg cctcataagt attttctaaa tttatttaca gtcatgccat ttaaaaggaa 36 0 

agttgtattc ctgtctttgt taatatttat aagtgatttt attcagctac aagcttggaa 42 0 

tggcatataa ttttgtattc tgcttttttc acttaatatt acatggctaa tgatttctgt 480 

gtttcataaa cattattctg atgatggcat gatatattgt tgagtacatg taccataaut 54 0 

gaatcatttc cctatfcgcta tgcaattaag ttgtttccaa tattttgcaa ttataatgtt 60C 

tcaatgaatg aataacttta tgcatatagc tttttgatat cttaagttca gtttcctagg 66 0 

atgaatttcc aggaatagta attgggcaaa tgggataaac atgactcttg aatacgtatt 72 0 

gttaacattg ctttcccaaa gggctcaact: gatttatatt tccgtgttca ttatctttta 78 0 

aaccagctca tttactcacc aaacattttt aaagccatta tcatgtggta ggcttagtaa 84 0 

gaagaaagtg accctaaggg agaagcttat atataaatag ggtccctggt gtaccaagtg 90 0 

ctgatacaga cacaaagtac ctggggaaat tgagatgagg gagtcctggc tcagctggga 96 0 

gaaaagttca ttttcataga gtcatggttt tgttctttgg cagaaagaaa attgctttct 102 0 

tccccacccc cacccccagc tttattgagg tataattgac aaataaaaat tgtatatctt 1080 

taagatatgc aatgtgatat atatgtatat ctcaacttaa aaaataagct acagaataaa 114 0 

aaggtgtttg ctattaaaaa aaaagaaaag gctgaatgtc attcccaagc ttggaaattt 1200 

gagtatgttg cctctttggg attatttaca gaaatattag caagaccagc cccatctttg 1260 

gtcttgagta ctccactgtc agcatgcttt cttccagaga gggatccatt tgcctttatt 132 0 

t-ttcattctg ttgtgccgtc tatgcaaact attcttgata gttttafcggt aacagtgttt 13 8 0 

ttttgttcca tgagataaat ttatacatgc tcattgtgga aaatttagaa aagacaggaa 144 0 

agtattaaaa acatcmcytt tttttttttt tttttttttt tttttttamg cagacagagt 15 0 0 

cttgctctgt cgcccaggcc ggagtgcagt ggcgtgatct cagctcacag caacctccgc 1560 

ttcccaggtt taagtgattc tcctgcctca gcctcccaag tagctgggag tacaggcatg 1620 

caccaccacg cccggctaat tttgtatttt tagtagagat ggggtttcac catgttggcc 1600 

aggctggtct caaactcctg acctcaggtg atccgcctgc cttggcctcg caaagttctg 174 0 
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ggattatagg caggagccac tgcgccagcc acacctacgt tcttatcatc ctagtacatc IB 00 

cactgtcatt atcttgctgt atttccttct gcccagtctc actctgatca tgcagtggcg 1860 

tgatcatgca gtgatctcgg ctcactgcaa cctaggcctt ctgggttcga gtgattctcc 1920 

tgccttagcc tcctgggttc aagtgattct cttgccttgg cctcccaagt agctgggatt 1980 

acaggcatac acccccatg.c ccatctaatfc tttgtatttt tagtagacac agcgtttcac 2 040 

taaaattttg tatttttagt agagatgggg tttcaccatg ttggccaggc tggtctccaa 2100 

ctcctgacct caggtgatcc gcctgccttg gcctcacaaa gtgatfcacag gcatgagcca 2160 

ctgcatccat cgccaaaaag attttttaaa agagtttaat gtagaaccat atcaaaggtc 2220 

tttggaaata aaaaacagtt ttttaaaaat atcagaaata aaacaacaaa taaataaata 2280 

aataaaaaca cccaaaacaa tctgaagcac gagcacctag cagaaaggtt caattatgat 2340 

ctattcatag agtggaatat caagtagaca ttacaggaca tgttttaaga ttatatttta 24 00 

tgtcatggga aatgctctcc cagtatgatg ttaaatgaaa aaacagaata caaaagtata 24 60 

tatgctgcat agtctcaata ttgtagagaa aaaatattat ttatgtatgc atgaaaaaag 2520 

acaaaagatg ttaacagaga tccattgtta cttcagttta ctagggattg tctctgggag 258 0 

gtaggattaa ggtgatttat atttaccttt ttaaactttt ctgtattttt ttattttcaa 2640 

attttccata aaaatataag gacttgaaga tcaagaaaaa atttctgctt tggctcagtg 27 00 

cagtcgtcac gcctgtaatc ccagcagttt gggagcccta ggggagagga tcacttgaac 2760 

ccaagagttt gacgttccag tgagctatga tctccggatc gtaccgcctg gacgatggag 2 820 

caagaccctg tctcaaaaaa aaaaatcttt gctttttttt tttgtttgtt tttgagacgg 2 880 

agtctctctc tgttgcccca gctggagtac agtggcacaa tctcagctca ccgcaacctc 2 940 

tgcctcctgg gttcaagcga ttctcttgcc tcagcctccc aagtacctgg gattccatgc 3000 

acccaccact atgcccagct acttttttgt attttcagta gagacagggt ttcaccatgt 3060 

tggccaggct ggtctcgaat fccctgacctc agctgatcca ccggccttgg cctcccaaag 3120 

tgctgggatt acaggcatga gccactgtgc ccagcccaat: cttttgcttt ttttaaaaaa 3180 

agaagacaaa aagggatttt ataccagtat tatcttggct gtgtgactct gaagccacag 3240 

ttgtaagtta taafctactct gaaacacaag gccctgtgac tcttttgggc tctttggtgt 3300 

ttatcttgat tacaacgttg gaatatagaa atgaaaggaa tgggagaggt gatagacttc 3 360 

aggcagtgta actagttgtc tgaacactac tggctcaatt atattgtgtc tagtgatttc 3420 

catcttgtcc gtctgctaat ttatcgcctg gtaactcact gaggcagggt tttcctttgg 3480 

agaaacctca ttgttttaac cagtgtatca tgcttgttta gaagttcaat gatcttttta 3540 

actcatcgga gaagatgatg accagacctg gacagatggg gaaggacttt gcactctctc 3 600 

tttacagtcc tgagtgcaca caggtcaata tggaactatg tgtgaatttt cattgtcttt 3 660 

gagagccctc ttctctgccc catagggagc agctttgtgt gcaattagag gagcaagggt 3720 

fcgtgtgtatt tagcacagca ggttggcctg gtcctctcct ctcaacatag tcaccacata 3780 

cctggcacta tgctaaggct gggaatgcag acagatgggt gcctgctttc agagtgctca 384 0 

atgtigctgag gaagccagca acagaaacag atgatttcag gagctccagg aaaatgctac 3 900 

aggaggagtg tgcctgggtt actggagtag cacaggagga gggcttctag ctcaggctga 3960 

gattttagta aaggaaatta tgccacgatg aatcctgaag aatgaataga agtgaaccag 402 0 

ataaagcacg ataggaagca tcttccctta cctaagggaa gacacagagg tatatggaat 4 0SO 

ggtatgttaa aaggttggga ctccaaacag ttctgttaaa gcttagagag tggtgggaga 414 0 

gactggagaa gttgattaat: tagtaaatga agttgtctgt ggatttccca gatcccagtg 42 00 

gcattggata tccatattat ttttaaattt acagtgttct atcttatttc ccactcagtg 4260 

tcagctgctg ctggaagtgg cctggcctct atttatcttc ctgatcctga tctctgttcg 432 0 

gctgagctac ccaccctatg aacaacatga atgtaagtaa ctgtggatgt tgcctgagac 43 80 

tcaccaatgg cagggaaaat ccaggcaatt aacgtgggct aaattggact tttccaaaga 4440 

tgctgtcttt gggaaacatc acacatgctt tggatcagaa aacctaggct tctaatttgt 45 0 0 

tgataaggca tgaactcagg agactgtttt cagtcctagt gaatggtgat aattgtaatt 45 60 

ataacagtag acaacatctc ttttacacat tttaaatcat gaaaatagaa taaccttact 462 0 

gataatttta gaaagtggtg attaaaagca catttaagat aatgccttaa cacctagtct 4680 

tttccatatg catgatgtct taatcacaca ttgcaaatca tggaacacag aatttt 473 6 

<210> 16 

<211> 4768 

<212> DWA 

<213> Homo sapiens 
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<400> 16 

atcttacaat cacagtcttt ctcttagggc tgggctcagt gggtggattg acactgcaga 6o 

aatggccaga tctaaaggat caacatttac gtagctggga aatgtagctg ggacttcagt 12 0 

ttcactgccc tagtgatutt tcctaccact aagcagctca gtccataccc ctacgagacc 180 

cacaagctta tgagatactg ttcttccagg aaagcagtgg ggccagggcc accttttaat 24 0 

tgtgtttctt ggcctggtcc catctttctc acaatatafca gcaacagtta tttacttgct 300 

gattttctaa tgcacatcac acatagtcat attaaacaca cacacacaca cacacacaca 360 

cacacacccc tcaagaaaca ttttctgaga cgtgatttcc tgatttcatc aaaaaagaaa 42 0 

agagcgggcc aggcacagtg ggaagtcaag gtgggtggat cacttgaggt caggagtttg 480 

aaaccagccfc ggccaacacg gtggaacctc gtctctacta aaaatacaaa aattagccag 540 

gcgtggtggc gcacacctgt aatcccagct actggggagg ctgaggcagg agaattgctt 600 

caacGtgcga ggctgaggtt gcagtgagcc gagattgcgc cattgcactc cagcctgggc 66 0 

aacagagtga gactctgtct caaaaaaaaa aaaaaaaaaa aaagcataaa ctgaaattta 72 0 

tatgcaattt atatgcctgt gagataattc tgttttctct tttggaaccc caaagagatt 78 0 

tttttgattg atgagcaaat acattttaga ttttatttaa gcattatgcc aagcaccact 84 0 

gaagtataag tttcaagggc aaactcagtt ttttcatcta ctagacgaat gattttctgg 900 

aatgattaca agcaggcaag atggtgtagt ggaaatagca aatgtcttcg gcatcagaca 960 

agttggggtt tgtttgtatc ctgcctctgc ccttcaccga ggttgtga-tc ttgggcagat 102 0 

tgttgagttt taacctagat tcctctgact ccagatcata aattttcaga aaagttctga 1080 

aattcttgta tatactgatg gtaaatgaga ettttcctta catctatgca cttctttgtt 114 0 

tgtttgtttt gagatggtct tgctctgttg cccagactgg agtgcagtag tgcaatctcc 120 0 

gctcactaca atgtctgcct cccaggttcc agtgagcctc ctgcctcagc ctcccaaata 1260 

gctgagacta caggcatgtg ccaccacgtc cggctaattt ttgtattttt agtagagaca 132 0 

gggttttgcc atgttgacca cactggtctc gaactcctgg cctcaggtga ttcgcccgcc 13 8 0 

tcagcctccc aaagtgctgg gattacaggc atgagccacc atgcccggcc atatccatgc 144 0 

acttcttgca accttacctt cttttctcat caccctccag ggacctagtt ggaagagcag 1500 

agttaaaagt taaggtgaaa cttggagagg tgtcttgtcc ctaggaacaa aggactggtt 156 0 

tgaaattctc tgtaaatctt ccccagttca aaccagagtt atcaaggtct taaaaacttc 162 0 

cctgggtcct gagagcccat tatattattt acttgtcttc ctgtacaccc actgcctagt 1680 

cctgatccta cttttgtttg caaataggat ggggcacaac gtacaaggaa gggcctttgc 1740 

cacccctgct aagggataac ctgaaatacc ttcaccafcca ctgccctgtg ctgcttttca 18 00 

cctatgccag tctgtctaca gtgccagtgt ctcctggcat tgaaagggga gaafccttttg 1860 

gtcctttgag tatttggttg ggttacataa atctccctga atgaagagca gctgacttag 192 0 
gcaaggggcG ttgtttggtt ttccttgaac tatfcaacagg aagataggga gattaactgt ' 1980 

gtaaatgttc aataggccag agtccctgca gagggtggcc acagtgatca gatcttatca 2040 

catccttgct ttgggtgttg cctctctggt tggagtatgg atagaaaaga aagaaagacc 21O0 

ctatattgaa atgcaaagtg cagcaagtcc tgactttgga ttaacttctc agcccatttg 216 0 

catgaaaata aaaagatgaa taaaacaagg ttcccacttt ggagggaggt ggtagctgtg 2220 

agatggaagg agtgttcctg ctgggcaaca gcagagtaag tgctggggta gattcactcc 2280 

cacagtgcct ggaaaatcct cataggctca tttgttgagt ctttgtccta caccaggcac 234 0 

tctgcaaaaa cgctttgcct gcaaggtctc atgcgatgct caccacagct ctgtgaagtt 24 0 0 

aattgtactt ttatcaccafc tttacagatg agaaaactga gggtatgggg tcaatgactt 2460 

ggctaaagtc actgcttagc aagctgcagg gactggatgfc gaattccaat tggtttgact 2520 

ccaaagcctg tgaagctact tgttcttcac cacctagagc tgtggttctt gataactgtg 258 0 

aactcttttg gggtcacaaa tagccctgag aatatgatag aagcaggagc tctggccttt 2640 

ctgtccatac ctgaacaggt ccttgggtta agagcccctc gtccagggcc tattaatctt 2700 

gatcctcata agcagcatcc atgtattacg gccgcaaacc aaactgtgcc agaccgaatc 2760 

ctaggaccaa gcccaaatafc gtcccatcat ccttttggta agaagctcat tgtaagaaag 2820 

aaagaggaga gcaagaggat gacctagtgc atggggcctc attgttttaa ttagtgacaa 28 80 

aacaacaafca ataacaacaa aacccccgaa gcttcacaga tgacatcaga ccccaagcct 294 0 

gtgtgttttt caggtgccct tgaggagctt tgtagctggc agaggaggtg aaactgacaa 3 0 00 

atgtttggca gatggaggag agtaccagag gggtttgaga tgagctaaat tccaatcfcaa 3060 

ccgcagtgtt gaggaagagg cttggattgg gaccatggag atgggggttc tactcccagt 312 0 

cacgccagct gactttgcga gtgttctttg tcagtcactt tatcttattt tatttatttt 3180 

tatttttttg aaatggagtt tcgctcttgt cgcccaggct ggagtgaaat ggcgcgatct 3240 
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tggctcactg caacctcccc ctcctgagtt caagcgatfcc tcctgcctca gcctccagag 33 0 0 

tacctgggat tacaggcgcc tgccaccaag cccatcgaat ttttgtatgc ttagtagaga 33 60 

cagggtttcg ccatgttggc cagggtggtc ttgaactcct gacctcaggt gatccgccca 3420 

ccttggcctc ccaaagtgct gggattacag gcgcgagcca ctgtgcccag cccacttcat 34 80 

cttaccgtag ttacctcctt agagtatgaa aaaataggct tagggcatcc ccaagtcccc 354 0 

tctatgtctg agagctgagg ctggctgtca aagaggaact aaggatgcca gggactttct 3 60 0 

gcttaggacc cctctcafcca cttctccaac gctggtatca tgaaccccat tctacagatg 3 660 

atgtccacta gattaagaat ggcatgtgag gccaagtttc cacctgagag tcagttttat 3 72 0 

tcagaagaga caggtctctg ggatgtgggg aatgggacgg acagacttgg catgaagcat 378 0 

tgfcataaatg gagcctcaaa atcgcttcag ggaattaatg tttctccctg tgtttttcta 3 84 0 

ctcctcgatt tcaacaggcc attttccaaa taaagccatg ccctctgcag gaacacttcc 3900 

ttgggttcag gggattatct gtaatgccaa caacccctgt ttccgttacc cgactcctgg 3960 

ggaggctccc ggagttgttg gaaactttaa caaatccatg taagtatcag atcaggtttt 4 020 

ctttccaaac ttgtcagtta atccttttcc ttcctttctt gtcctctgga gaattttgaa 4080 

tggctggatt taagtgaagt tgtttttgta aatgcttgtg tgatagagtc tgcagaatga 4140 

gggaagggag aattttggag aafcttggggt atttggggta tccatcacct cgagtattta 42 0 0 

tcatttctgt atgttgtgaa catttcaagt cctgtctgct agctattttg gaatatacta 4260 

tatgttgtta atgatatcat gcagcagacg tgcatctgaa tgggctggct ctaggagcta 432 0 

gagggtaggg gctggcacaa agatgcatgc tggaagggtc cttgcccata agaagcttac 43 80 

agccaaggct aggggagttc tgtcttctct gcatcaggtc acctctctca cctctgtcac 444 0 

tgccccatca gactacaatg tctgcaggtc tttctcccct gagtgtgagc tccctgagca 4S00 

aagcaggatg ctgccccttc cctttgtatt ccttgctcct tgcttcagtg cctgtacata 4560 

agtacgggca taataagtgt cccccaaatg agacattgag gattcttcaa atgcacagga 4620 

ccgtgatgtg agttaggacg gagtaaggac gatgggatgt ggctcatgac aatcctgagg 4680 

aagctgcagc tgcggcacgc agggccacac tgtcatgttc atggacccta gactggcttt 4740 

gtagcctcca tgggcccctt ccatacac 4 768 

<210> 17 
<211> 1295 
<212> DNA 

^<213> Homo sapiens 
<400> 17 

tcatgactgc cattggtata aagatgaata taatccagac cagattcatg attattcata 60 

catttttagt gtattaactt ttaattctgc ttttaaaata aattaaaaca ttctaatatg 120 

cccttaagag tatcccagcc caggccactg agcctactgt ggttcatgga taagtttgcc 18 0 

Gctgggggca tgtgtgtgca tgcatgtgtg tgcacatgca tgatgagccg ggccttgaag 24 0 

ggtggtaaga tttgggtgtg tagaccaatg gagaaaggca tttggggcag tgatgatggg 3 00 

tgggggaggg. aacatggtga tgaatggagc tgggtgtggg gagccatggg agtgggttag 3 60 

ggccagcctg tggaggacct gggagccagg ctgagttcta tgcacttggc agtcacttct 420 

gtaaagcagc agaggcagtt ggcctagcta aagcctttcg ccttttcttg caccctttac 480 

agtg.tggctc gcctgttctc agatgctcgg aggcttcttt tatacagcca gaaagacacc 54 0 

agcatgaagg acatgcgcaa agttctgaga acattacagc agatcaagaa atccagctca 600 

agtaagtaaa aaccttctct gcatccgttt ataattggaa attgacctgc accagggaaa 660 

agagtagcGc aggtgtctgg ggcttgttcc cattagatct tccccaaggg gtttttctcc 720 

ttggtggcfcg gcctgtgggg cccctctcca ggaggcattg gtgaagaaac taggggagct 7 80 

ggttgccaca gacagtgatg tactaatctt ctctgggaag acagaagaaa agtccccagg 84 0 

gaagaat act acagacttgg ccttagggac agctaggggt gcagattgct gccaactgca 900 

ttttttctga agttggccat atggttgcag tgaatggatt tatagacaga gtatttctgt 960 

gcatataaga gcaattacag ttgtaagttg atatggataa gtgaaagtta agcacttctt 1020 

tctaaaaaga gaatgcaatt cattttcccc taatcatttc aattagtctg atgggcattt 1080 

gaacttgttg tctttaaaaa gtgaaatctt tacctctgat ctggtaagta tccaggcaat 114 0 

ttcttgtgtg ccacccagga ggtatctggg gagtgggcat tttctgactg aggcattggc 12 0 0 

tgccatagca tcagagcagc cttccaggca gtggcctggc aaggggacag aggctggtgg 12 60 

gagcagctgg ctgagtgcag ccagtaatgg catgt 12 95 
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<210> 18 

<211> 2188 

<212> DNA 

<213> Homo sapiens 

<40Q> 18 

agctctccag gtgattctga tgcatactta agtttgagaa ccattgcfctg ttttgcatta 60 

aacaggagat tagtctctgc agcttgtggg aataaagctt taaatctctc caattttagc 120 

tctgtgaaaa ggcagtgggg agacaggaat gaacggacta gtgccacaaa gctcaggtgg ISO 

ggtgggtgag atcatttaga agagaaagac cgggcatggt ggctcacgcc tgtactgtca 24 0 

gcactttggg aggccaaggc aggttggatc acaaggtcag gagtttgaga ccagcctgcc 3 00 

tatcatggtg aaaccctgtc tgtactaaag ataaaaaaaa aaaaatttgc cagtcatggt 3 60 

gatgcatacc tgtaatccca gctactcggg aggctgaggc aggagaatct cttgaacccg 420 

gga99cgggg gttgcagtga gctgagattc caccattgca ctccaaccta ggtgacaggg 480 

tgagactccg tctcaaaata aaaaaaaaaa aagaaaagga aaggctgtgt gtgtgtgtat 540 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtaa cagcaccatc acactgtttg agttgaggag 600 

cacatgctga gtgtggctca acatgttacc agaaagcaat attttcatgc ctctcctgat 66 0 

atggcgatgc tcccctatct cattcctgtg tgtgtttagc caggcaactg ttgatcatca 720 

atattatgat aacgtttctc cactgtccca ttgtgcccac tttttttttt trtttgagtt 780 

acttactaaa taaaaataaa acactatttc tcaatagact tgaagcttca agatttcctg 840 

gtggacaatg aaaccttctc tgggttcctg tatcacaacc tctctctccc aaagtctact 900 

gtggacaaga tgctgagggc tgatgtcatt ctccacaagg taagctgatg cctccagctt 960 

cctcagtagg gctgatggca attacgttgt gcagctactg gaaagaaatg aataaaccct 1020 

tgtccttgta atggtggtga aggggaggga ggtagtttga atacaacttc acttaatttt 1080 

acttccctat tcaggcagga attgccaaac canccaggag tggaatatgc aacctggcgt 1140 

catgggccag ctggttaaaa taaaattgat ttctggctta tcacttggca tttgtgatga 1200 

tttcctccta caagggatac attttaagtt gagttaaact taaaaaatat tcacagffcct 1260 

gaggcaataa ccgtggttaa gggttattga tctggaggag ctctgtctaa aaaattgagg X320 

acaggagact ttagacaagg gtgtatttgg agacttttaa gaattttata aaataagggc 13 80 

tggacgcagt ggcactgagt tgagaactgt tgcttgcttt gcattaaata ggagatcagt 144 0 

ccctgcagct tgtgggaata aggctttaaa tctctccaat tttagctctg tgagatggca 1500 

ctggggaaac agaaatgaac ggactagtgt cacaaagctc aggtgggatg gacgagatca 1560 

cttcaaaggt ctgtaatccc acgtctataa tcccagcact ttgggaggcc aaggcgggaa 1620 

aatcacttga ggtcaggagt tcgagaccat cctggccaac aatgcaaagc ctgtctctac 1680 

taaaaatatg aaaattagct cagcgtggtg gcatgctcct gtagtcccag ctactcgtga 1740 

ggctgagaca ggagaatcgt ttgaacctgg gaggcggagg tcgcagtgag ccaatatcac 1800 

gccattgcac tccagcctgg ctgacagagt gagactccat ctcaaaaaaa aaaaaaaaaa i860 

aagaatttta taaaatcagg aaataatatt agtgtttatg ttgaatttta actttagaat 192 0 

catagaaaac ttcctctggc atcattatta gacagctcfct gtgcagtggg tagcaccaga 1980 

cccagcttgc atggttattg atttttcaga gacacttttt gagcttattc tctggcagaa 2040 

agsggaactg cttcctcccc tatctcgtgt ctgcatacta gcttgtcttt acaagaagca 210 0 

gaagtagtgg aaatgtttat tcttgaaaat aagctttttg cttcacatga tctagaattt 2160 

ttaaaattag aaaaatgtgc ttactgcg 218 8 

<210> 19 

<211> 1183 

<212> DMA 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 
<222> (1) . . . (11B3) 
<223> n a, t, c, or g 



<400> 19 
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agtaaaatgg agaattccaa 
atatcgacac ttacagataa 
acttaaagac aacataaa1:t 
ccaggaattc agactctgga 
gagttggact ntggattttt 
tgcaacttgc aggttttaag 
Ctcagtaaag agacctgatg 
agctgtattg atatgaagtt 
tcctcttgtg cttgtctctc 
gttacatttg acaagtctgt 
ccaagaagtt tctgagcttt 
acttcgttcc aacatggaca 
aaacttcaag cactaatgct 
tttgtagaaa agtacggctg 
agtgttctta gtgttctttc 
aagctgggtg tggctgagtt 
cctgcttttt: catgttttta 
tcttcatatt taaaaaaaaa 
tgtttaaaaa cccatcataa 
ttagatgaga ccacactatt 



attctgaaat tgttagaaca 
atageataaa tgctttctcc 
gcaaaatagt gaggatgttg 
ttcctgtttg ccaaatcatg 
ctttgcaaga gggacaagag 
tgtcctgtca ttgtgccttg 
cattggactg ttgcaatgga 
ctccaaaaga cttcaaggac 
tttgcatgaa atgcttccag 
gcaatggatc aaaatcagaa 
gtggcctacc aagggagaaa 
tcctgaagcc aatcctggtg 
ttcggaatgt gaggcttttc 
gctgggagtt tgtgatataa 
agacttttgg gccatctccc 
taagccaaaa gttttttgtg 
ctattataat actaagcaag 
aagtcttgaa accattgatg 
agatgacatt gtgggctgtc 
tagcttactt agtaataaca 



tagttctgtg tcttagttaa 
ccatatttca gcccagtcct 
ttcatctaat aaaagtggtt 
tgtcccactc ttaagaaaac 
tgtgggagat actgagttaa 
tgctttgata cattctgagt 
acctgtttta agatcttcaa 
ccagcttcca atcttcataa 
gtatttttgc aaggctacca 
gagatgattc aacttggtga 
ctggctgcag cagagcgagt 
agtagacttg ctcactggag 
cttggacagc atgactttgt 
tttagttcag tggtattcta 
aaagggtgaa tgggaagaat 
cttgtttcaa tcagagaaga 
agctcatttg aaaacagagt 
ggaagatgga tatctattta 
acagttggaa ggccctggaa 
ttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1183 



<210> 20 
<211> 8981 
<212> DNA 

<213> Homo sapiens 
<400> 20 

ccgtttggca aatgctcagt aaaagaaaag ggttagaagg ggagaaaggc attttatccc 60 

aagccttcag gaatcaggat gaggatgtct tcaccttgtg gtggggagta attatacaat 120 

tagagacagc acattggagt gtggctgata tgctgtgtga tgatagctct agctctctgc 180 

cfcagcagagg aaggacattt caatagaaga aaaagtttaa gaccttgccg agaaacagag 240 

aaaggatgtt tgtcttttta agaagttgaa aacccfcgttt gcagacaaaa gccctccagt 300 

tttggcagta aactttcatg caagggaaga aaaaggcagg ggatgacatt gttgacaatt 3 60 

gtgaggaatt accatgtgcc aggcactgtg cgaggggctt tgtacatatc ctctagtttt 420 

agtgcttata aaaactctgt gatatgtgca cagcatttta aactttgctg catagtcgag 480 

aaaatggaag gatggggaat ttgagtcatt tgcccagggt tctatagcta ccccaggttc 54 0 

ccafcgactgg agaattgggg cacagggtgg cgggggagag tgagtgacaa gaatcctaac 600 

aatcttattt ccattgagtc cttataaaag aagtggatta actaccacgt: ttttaagttt 6 60 

ttcttaaatt taggttatgt ggatctggcg tttcctgttt tgtcctgggt ttgttttgtt 720 

tfetgctatgc tgtcttgaac atctgtcatc ttgtaggcct aacggtaaac aciaaaaacac 7 80 

tttacctcct atagctttca attaagatct ctcagtttgt gtttgtaata gttttccagg 840 

caagttctcc ctaggttcgg cttctagtgt gttaaccttt agttataaag tgaacccaaa 900 

gagagaaagt agaaacaaaa cacctcacct gtttttgctc atgaatfcact ctctatggaa 960 

ggaacaatca tgaacacctc tgcgtatcac agaggcctat ctgagtctga cgtttaaggg 1020 

agaccgcgta ggtccctttg aggactgtga atgtgggagt cctgggactc tggtgaagaa 10 BO 

cccgttccag aagagatgaa tgagctggac aagttctttc atagaacctt taggcaggtt 1140 

ttcttagaaa tgcacattga ggattatgct tggatattgt gatgatcaga atgatactca 1200 

atcccttctg catttggaat tctctttgaa agaaaacatc ccaggcagct atttctcaga 12 60 

gatagtgagt cccagccact tctagacatt ttcttgtgta gtctacatta taatttcaca 13 20 

gcagtctctg atatgacaaa tgtcaaaata gcccaacctt ctctaaactt cagagatgtc 13 80 

tgatatgata ttgaataaaa caatgctcat agaaacatca agaaaggtgg attttccctg 1440 

gatacttttt tcctgcttga caaataacag tgaagaaact gatctcacgt ctttttctct 1500 

ttggaagcct gaacactcag aacccaactt gaggctcctc agctatagca attctgactt 1560 

cacagtctgt aaattattgt tctttttttt ctttagctta tgctttctgc cctaatttat 1620 

cttttccctg ttctaatgaa ttattgtcct atatctgctg tgcagttagg tgacatataa 1680 
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cagcaattaa atatatgaat tggtacatat aaagatttga ctaaaactcg atgtaaaaat 1740 

aagtgttcta cattcaattt ccagtgttag aaacagtgct gacttgaaca gagtgacaga 18 00 

attccatctt tccctatttt tgacagcttt aaactttata ttttcttcct ttcttgtgag i860 

ccgtcattaa cttgtttctc aaagccattc ccgtattacc catcttgcag acgcagacag 1920 

atttgggaat ttgcggtcag agttgtattg gacacatccc cccagcccac atgagatcct 1980 

tttaatctat tgcatattaa ctagttttaa gtacaatatt cctacttcat ttaaaaccat 2040 

taatcaaaga atgagtttga aaatgaacaa aatgcaaact tacagttaga aataattgta 2100 

gtgtctttag ttttggttag gagtcggttt cttgtttgtt aaactcaaga ttgtgaacag 2160 

ttttaattca cttgtttatt tccaatagag atttcaggtt tacatttgaa ttcagaaaca 222 0 

aagttttctt tctcattaca gagaacacta aactctacat ctcccttccc gagcaaggag 2280 

ctggccgaag ccacaaaaac attgctgcat agtcttggga ctctggccca ggaggtaagt 2 340 

tgtgtctttc cagtaccagg aagcggatca tccactgtat cagtattttc attcctgagt 2400 

ctggcaagag gtccttttga gttgaatatc acatgggatg taatatcaat tttcaaagta 24 60 

taagtgatgt aaacaataat gttttgattt ccttatttta gaaatgaaga aacctaaaac 2520 

tcatagatgt ctcagagcta attggttagt ggctaacagc tggatatcta gtttagaacc 2580 

ttctccattt tttctttttg cccctaggta atcatacatt tgtaaagagg agaattatct 2640 

ctgccactgc ccatgcactg cttttgtctg accagcaatt tctccatatt gcttcttcag 2700 

tagcaaggcc aatcatttta ccaacacaca tgcttgctaa ctaacaggaa taacgtggta 2760 

cccctaattc agccctttcc cttgaaagca tctggcttct gaggttcaac tatgggaata 2820 

tggtctctta atgaaeatta agttgagttt gccttttagg tccacatgtt gacaaatgta 2880 

tcagagtaat ctctgtccta ggatcagagg gcctgtaggc acttgcaaaa gcagttagct 2940 

ctgactccca gccagtgcac actccacctt tctgactccc agcjcttgtct caaattaggc 3000 

ttggaagcga ggaactgtct ggtgtccccc agcataggaa gctgagccag ggggcagtgc 3060 

tcacaaacaa tacagacttt aacgtgtagg atattggaaa ataataattt gtggggaaat 3120 

tgtctcagac ttggtccacc cttattttta gctgcttctc taatccgttt ttcttttttt 3180 

ggtgcttgta tctaacctac ccattttttg gtgcttgcat cattttttca aatatcaaaa 3240 

acgaacttta tgttttctaa caatgaaagt attgcatgtt cattgtggaa aatgctgaag 3300 

acttggaaaa tacaaaaatg ctgagatcaa acactattga tacgttagtg tatttcttcc 3360 

tgtcctgttc tactttcttt ctttgaattc tgctcacgtg tttctgactg atgaggtctg 3420 

acttttgggt tccttttcca gaggagaagc cttctttcag cttgccattt gttaccctgg 34 80 

ttatgaaggc tggtaacctt ttttactagg tagagaagct ggaccaactg gggttcttcc 3540 

agggggagaa tgagaaagag aaactgtttt gcaagtccgt agctatttct ctagggccct 3 600 

gttagctgac attgacatgc cttgcattgc tctgcagatc ccctcgcagc cctctgtccc 3660 

ttgttcattt ctggccttag agaaagcaaa gcagggtctg taacagggga ggctgcctct 3720 

aaactcaggg tttggttaca gctgttttca cttacarcac tggccctggt tttttttttt 3780 

tttctggcat taaaaaaaaa aattggaagc aggtgatgtt cccattgctg afcgtggtgga 3840 

aactctccaa gtgaacaata tacgtttttc ttggcagctg tttcttgtgc cctgcttgct 3900 

cctggtccag gacaagcaag gaccatctgc ctctttcaat agaacacctc cagatccctt 3 960 

tgatcaaaag ttactcattg tctgacttgc tatttctgtg agataaatgg gagaagatca 4020 

ataaatgcac ttgtttgtcc agtcagcgtg tggaaagttg ataattttga ccaaagcaca 4 0 80 

accctgaaag gaaaagaaaa agggagtgaa tgtcttctga gaagctgcct aggttcagac 4140 

agtgtcaccc atttccctgt atgctccaca tgacaaacet gagtgggtct catcatgtcc 42 00 

attttgcaga tggcaccaag gctcagaaag gttaggcaac ttttccagtc acccaatgag 42 60 

ttaattgaca aaactgggat tcaaacccag aactgttgga ttccaaagcc tgtgttgttg 4320 

cctgcttcgt gaaaaactcc agfcagcgact ggaatagaaa ggagaacctt ccaagaaaga 4380 

aaatacgcac tagcagaacc tggaaattgg gaggaaatga ggacttgagg aataagatga 4440 

atgaaagctg acctgagttt cacatctggg tgatgggaag ggaggacagg gaggcagcat 4500 

ctcagatgtc cacccagcac cgaccagctg cctggcattg ctaggtgttg aggactcagc 4560 

agtgaacacg ctaacttctc tgctttcttg gggcacgtat agggtgagag acagaaacaa 4620 

acaggtcagt gtacaatgcc acaggaggga tatatgcagt gaagaaaaag cagggtaagg 4680 

ggcatagagc atgagaaggt gcttttttta aaggggktga ttaggaaagc tctctctaag 4740 

gtgacagttg gacctgaagg agatgatagc atgtctgtgg tgagggaagg aaactccgaa 4800 

caggaagaat ggcagataca aagacattga tgctagagca tgcctaagga atgtgtttaa 4860 

ggaccaggga aagtgagcaa gtggtggggg gaggagagga gctcagagca ggaggaggtg 4920 

agtgcjcatac aggcctggca agactttgga ttcctgctgg gtgagatgag aatccagcgg 4980 
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agggcttgag ggaggggaca tgatgtgatc tagagtttag actgtttaca ctctggttgt 5040 

tgggttgaga agagactggg atgggggaaa gggaggacaa aggacattgt gctggattga 5100 

gaaagcagta agtcagtttc attcattcac tcaaccgatg atgttcaaat accaccatca 5 ISO 

tccgtgggct aaaggatgaa gagccatccc tccctgagag tcaggaagca cttcccagat S22 0 

aaagtttgga gtgtgagctg aggtgtagga gaaagagtaa gagtttaccc ctgaaacggg 5280 

tgctgggaag agtcaatagt ttggaataac tcaataattt atggtgcttc tttagaaaga 534 0 

tttgctggct ttatgtggga agaaatttkt ttttttgatt ggggagtggt gggttggtgg 540 0 

tgaggctgcc tgtggaaaga gaagtgagtg ttttgactca ctgttattta aaaatctcta 54S0 

gggctgttcc aataagcaac aaaaggcaaa atggcctggt tctctgtccc ctttctgtct 5520 

gtatgcctcg tacaggttat gaaaagaaaa agttgggaaa agctgtccac ctcacctaat 5530 

tgtgttcttg tggagtgtgc tagatgcccc ctctctggag aaaaaaaatc cttgtggcct 564 0 

ctgacccacc tctggagagc ctagttccct tctggaggca gaaggcaaag cttaggacct 570 o 

agagagtgct ggaccacgcc actcacagga accagcaggc tgtgaggttg aaagctaggc 5760 

atatggagct ttccaggctg ggtgcagggc ctcgtggccc ttcccctccc ctctgtgctc 582 0 

tatagctcag tcttcccagg cggtgtgaac acgcagtgac atttccagga atacagggat 588 0 

ttattaatga tttcttgtga aatgtttgga aatacaaagt actctataaa tatttcataa 594 0 

tagcattggg gctgagaact ccacaaagtg ccggaataca tttgcatgta agacagaacg 6000 

ctgcctgggt cattgatgcc tgttgagtgg cagtcacaga cactgcctag ggtttctgac 6060 

tcacgctgtt gggactgttc tatgcagggc accctcttgt gtggcatagg atttgtgcct 612 0 

caccacacac tgttgtagct ttgctgtctt gatgatgagt agagggcagt gtccaggcca 618 0 

tggtataagc atctactgcc ccccagggtt accaaaacca agccaagttg tgtctcagcg 624 0 

agctccgtga agcatggaga agttgagtac tcagagacat gacgtgactt ttcaaaggct 630 0 

gtaagctgac gagggacata gctagggttc agacttgagt ttttcttttt ctttttcttt 6360 

ttcttttttt tttaagactg agtcttgctt ttgtcgccca ggctggattg cagtggtgct 6420 

tggctcactg caacctctgc ctcccgggtt caagcaattc tcctgcctca gcctccccag 6480 

tagctgggat t^caggcacc tgccaccatg cctggccaac atttttgtat ttttttagta 6540 

gagatggggt ttcaccatgt tggccaggct ggtcttgaac tcctgacctc aggtgatcca 6600 

cccgcctcga cctcccaaag tactgggatt acaggtgtga gccactgcac ccggcccaga 6660 

ctcgagtttt tcatcttaat gctttttcat tgcctgacac tttactgaga ccaagatagg 672 0 

gaacttcaca tacagtacct tttctcccaa ggcggaagag ggctgttcaa tttctacact 6780 

agagttcggg gagttttaga aatgagtcag ttatcgagga tgagagcagt tcctgatagg 684 0 

ctcaaccaca atgagatgta gctgttcaga gaaagcattc ttttatctat aaactggaag 6900 

ataatcccgg tgaaacgaag cccagcccca ggggcttcac taactccagg ctgtgcttct 6960 

caaactttag tgagcatagg aatcacctgg gcatcttgtg aagctgtaga tttgaattct 702 0 

gcaggtcggc agaggggtct cagaatccgc atttccaaca atgtctccag taatgctgat 708 0 

gctgctcgtc cctggaccac agattgggta gccaggttct ggcaagctca tcccaaggct 714 0 

ttgagatgac atcagacaaa atatgttctg ggacatggct tttgagaggt caagaaaata 7200 

agatgtttct ttctcttctc atccccaacc cttgcactgc ccttttctcc cttcccctac 726 0 

cctcctttct gtccccatcc ctgacgccag ctgttcagca tgagaagctg gagtgacatg 732 0 

cgacaggagg tgatgtttct gaccaatgtg aacagctcca gctcctccac ccaaatctac 73 8 0 

caggctgtgt ctcgtattgt ctgcgggcat cccgagggag gggggctgaa gatcaagtct 744 0 

ctcaactggt atgaggacaa caactacaaa gccctctttg gaggcaatgg cactgaggaa 75 0 0 

gatgctgaaa ccttctatga caactctaca agtgagtgtc catgcagacc ccagccctgt 756 0 

ccccaacGcc atccctccct tagttctggc cttggcctgt gtcatctcct ccctctgtag 7620 

cagcgttaga tgtctacatg cccatttgcc caccagactg agctcttcct agaggagaga 768 0 

ggcttctctt gaatagctac ctgtccccag ttctctgaat gcagcctggc acatctcagg 7740 

tgcacagtag tgtttatcaa tggaatgaat gattgacagc caaccttctg gttttctggg 78 00 

ggatgtggaa gggtggcttc cagggtgatc aagaatgaga taatggcaga aggacaaatc 7860 

ctgcaagatc tcacttatat atggaatata tgtaaggtag aaagtgtcag tttcacatga 792 0 

tgaataagtt cctgggatct tgatgtacat cgtgatgact atagttagta acactgtata 798 0 

gtatacttga aatttgctaa gagagtagat ccgaagtgtt cacactacac aaaaaaggca 804 0 

actatgaggt gatggattta ttaacagctt gattgtggtg atccttttac aaagtataca 8100 

tatattaaaa catcacattg tataccttaa atatatacaa tttttatttg tcagttgtaa 816 0 

ctcaaaaaag ctagaaaagc atttttaaaa aggatgatgt actggtctta atattaccat 822 0 

tgagataagc tttataataa cataaaaaga aataacagta atgataatag caacaacaac 82 80 
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aacaacaaag aactaacatt taagtagaat ttcttgtgca ctgtgcattc tgtttaagtt 8340 

atctcatttt accctcatga taacctgcag ggaagattct ttaaccccac atttcatagg 84 00 

ctcagagagg ttaagtgcct tggttagagc cacatcagag ttaatccaca agagccagga 8460 

ttcaagccca aatctgcctg gatctgtgct ctctaagata actgttagtg gtggcgtgtg 8520 

tgttctcaca ctcagacatt tgatctgccc tttgtttccc attcttagct gcaaggcagt 8580 

gttaaagaac cctgtgtctc catatccact ccccacactt aagcactttt gtgggcccgt S640 

gtgccgtatg cctcgtggca gcagggatcc aatgtcacag ttttaggcag tggcatcctt 8700 

ttccttgaaa acttgatgca ggggaacctt tctccatttc caaccacagg tgtgtctttc 8760 

agacactgag tgaggcaggt tttgtacttt attgtaacac aagaaccttt tcttctctgg 8 820 

agtaaagcac tccagacatt cgcaagttgc tttacaagcc ttaaaaggat ggtattgtag 8 880 

gcaactttaa ttaaatccca tctcctcctc tcccccagct tgcaagttga cccaaggaag 8 94 0 

ccttcatttc catgacagac ttaattgtga gggcatcctc a 8981 



<210> 21 

<211> 20284 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> Tnisc_f eature 
<222> (1) ... (20284) 
<223> n « a, t, c, or g 

<400> 21 

actgtgttag caaggatggt ctcgatctcc tgacctcgtg atccgcctgt atcggcctcc 6 0 

caaagtgctg ggattacagg cgtgaaccac tgcgccctgt tgagaatttt tttttttttt 12 0 

tttgggagaa agagtttcgc tcttgttgcc cgggctagag tgcagtgaca caatctcggc 18 0 

tcactgcaac ctctgcctcc tgggttcaag caalttctcct gcctcagcct catgcgtcac 24 0 

cacgcccagc taattttgta tttttagtag agacagggtt tctccatgtt ggtcaggctg 300 

gtctcgaact cccaacctca ggtggttcgc ccgccttggc ctcccaaagt gctgggattg 3 60 

caggcatgag ccactgcgcc cagccccaaa ttttggtttt tgcttgaaaa ctgaggtctg 420 

aattcagcct tctggttgcc cctcaagagt cagtttaaat gttggtcatg ttagttgtca 480 

gtgaaaacaa tggtgaggct ggcatgagag tgtgaatctg gatigggaggg cttgtgcttc 54 0 

atgaaaacat ttttcnagat cagctcagtc gtgagttatc cgtcattgac gttataataa 6O0 

gctctgatta tttatcaagc atcattcttt atagatatct cagtttaatc tgagataatc 66 0 

ttctccacat ctctccacat agatgttatg aattttactt ttacagagga gccaactgag 72 0 

gctcagataa gttacttatt atatgactag tagtggtaga gctggggttt caactaagaa 780 

ctctctggct ccaaagccct tgtaagtttc tatcagtata tgaccatgca tatgagcatt 84 0 

tgtctctcct cttcttcata gctccttact gcaatgattt gatgaagaat ttggagtcta 90 0 

gtcctctttc ccgcattatc tggaaagctc tgaagccgct gctcgttggg aagatcctgt 96 0 

atacacctga cactccagcc acaaggcagg tcatggctga ggtaagctgc ccccagccca 102 0 

agactccctc cccagaatct ccccagaact gggggcaaaa aactcaaggt agcttcagag 108 0 

gtgtgcgcta agtatactca cggctcttct ggaattccca gagtgaaaac ctcaagtctg 114 0 

atgcagacca gagctgggcc agctccccag tcgtgggtat agaatcatag ttacaagcag 1200 

gcatttcttg gggatgggga ggactggcac agggctgctg tgatggggta tcttttcagg 1260 

gaggagccaa acgctcattg tctgtgcttc tcctcctttt tctgcggtcc ctggctcccc 1320 

acctgactcc aggtgaacaa gaccttccag gaactggctg tgttccatga tctggaaggc 13 80 

atgtgggagg aactcagccc caagatctgg accttcatgg agaacagcca agaaatggac 144 0 

cttgtccggg tgagfcgtccc tcccattatt accatgtgcc tgcttgatac tggagaggtg 1500 

agtttctggt cactttccca ggtgtgagtg aggtgagaat tctttcagtt tatctagctg 1560 

ggggaatgta gtgagcatag ctaaagtcac agggcaccac ctctccagaa gtacaggcca 162 0 

tggtgcagag ataacgctgt gcatatcagc atccatgcca ctcacggtca aatagcagtt 1680 

ttctgcaaaa cttagtgagg gctggtgttt ggaagtggag ttgagtaatt gcagtaccct 1740 
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attttccttt ttgctgcagc ctctcagcca gccacagcat ctccctgtgt cttggtaggt 1800 

tttggaaaga agtgtgggag caaaagcatg atgttacatg tagactggcc tgagatactc 1860 

attctcaggg cactgtgtga atgatgagct gctgttactg tgtggagggg aaatgcactt 1920 

agtgcttcag agccacttga aagggataag tgctctagag acaattgggt tcaaatgtgg 1980 

agcaggctga gcaagaacag aatgtctcct ttgcctgagc ctgagtgctg ttaatcacat 2040 

cttcctgcct tgggctgagt tagagaatca ttagactatt tcctgtttcc atggtgaggg 2100 

aggcctcttc cttttgtctc tgctcccctt aagaagcagg tgaggatttt gccaggtttc 2160 

ttgttttgaa ccttattgac tttaagggcg gctgggtttt agagactgta cctacctagg 2220 

gggaacactt ccgaagttta ggactattcc ctgatccgct gggaggcagg ttactgagga 22 80 

agtcccttta aaaacaaagg agtttatact gagaaaagca taaacagtga tttgtatgga 2340 

ttcacactga ctaatatagc tcatgccatt aaagtggggt ctcttctcta aaggagggtt 2400 

atatgatcta gccccgtaga cctaagtgtg gttfccagacc tgttcttcct ggtcctctcc 24 60 

ttggaatcca tatttctact agttggactt tttctgtttg tctggctctc agaggattat 2S20 

aggaggccct gtgaagtgac tqagtgaatt ttgatttgtg ggcaagtaga tggttcccta 2580 

gtctgaaatt gactttgcct taggtgcttc aattcttcat aagctcccag ttcttaaagg 2 640 

acaagatcct tgtaaacatg gcaatggcat tcattaggaa tctagctggg aaaatccagt 2700 

gtgtatgctt ggaaatgagg gatctggggc tggagagaaa ggcatgggca tgccttggag 2760 

ggacttgfcgt gtcaagctga ggacctttac tttaagctct aggggaccag gcaaggggag 2820 

atgtagatac gttactctga tggggtggat gaattgaaga aggatgaggc aagaatgaag 2880 

gcagagacca gggaggaggc tctccaagtg gccaaggcat aaagcaagaa atgaggcctg 2940 

gtgactgctt agtggcagag cagtgaaaga gagggaggca tcaaagtgag tctcgatttc 3 000 

tagctgggtg ggtggtagcg atgtccagta ggccagtggc tactgaggtc tgcagtggag 3 060 

gagggtggtt gggctggaga cagatgatga gggagtcatc agcctgtggg tggaagaaaa 312 0 

gggaacctct tccaactgtt ttctttgctt cttccctctc tttctctttt tttttttttt 3180 

tggacagagt cttgctctgt cacccaggct gaaatgcagt ggcatgatct tggctcacca 3240 

cagcctccgc ctcctgggtt caagcaattc tcctgtctca gcctccagag tagctgggat 3300 

tacaggcaca tatcactgtg cccggctaat ttttgtattt tcagtggaga tgggatttca 3360 

ccatgttggt cgggctggaa tgaactcctg acctcaagtg atccacctgc ctcagcctcc 342 0 

caaagtgttg ggattacagg catgagccac cgcgcccggc ctttcttccc tctcttaaag 3480 

agtgtttatt taattccaca aacatgagct tgtcaccccc tgtagcctgg catctcctac 3S4 0 

acgaggtgat ggctgaggct tctgcttctg ctggggtagc tctgatcttt ctgctttctc 3600 

tggcactgtc tacccatgtt gcctcacccc acaggtccca gggcacctct ctcgggcaag 3660 

tcttggaacc ctctzgacact gatttgctct cttttctgag ctgcttttag ccacccatcc 3720 

tcgggacctg ttttctctct gcccccaccc ctgcgggcag tcttaggtct cctgcccctc 3780 

acgagcaccc cagagaggcc acgtgctcag tgatctcagt gggcgcatct ttctagtctt 3840 

gctattcttt ttggccatgt tgttcagaaa ccatactggg cagggccgac ttcaccctaa 3900 

aggctgcgtc tcttcactct gcttttgttt gttccaaata aagtggcttc agaattgcta 3 960 

accctagcct ctgtgaactt gtgaggtaca attttgtgtc tgttatgtta acaaaaatac 4 02 0 

atacatacct tcctggtgat ggtataaatt gctattctct attggaaagc aatttggaat 4 0 80 

gaaaatttaa agaaccattt taaaatatgc tatcctgcgt acctccattc cacccacccc 414 0 

cagggatgta gcctactgaa ataattttaa agaagtcacc atatgagaga aaatgttatt 4200 

gctatattgt tattgtgaga aattggaaat agactaaatg ttcagcacta taggaataat 426 0 

taatgaaatt acatatactc tatacaatca ttatgctgcc attgaaataa taaatacaaa 432 0 

ggcgcaaggg gggaaaagct tataatgtta gtgaaactaa gactgatttt tttataaagc 4380 

agcagttttc agacccttgg agactccaat tcggtagaac cagagcttca tcttctctgt 4440 

cgaagctgtg acaggagttg caaatgcctc tcctttttgc tgagtttgca gctgctgttt 4 500 

ttccggcagc acatctgtgc aggcctctgc ctcggcccct ctggatctgc tgattgagca 4560 

gcggattgat ctgtGctt:ct ctttcgtgtt gacccatgtg aggaaccaac tggcaaggga 4 62 0 

acaagaaatg gaaataggcc tcctttgcat catgacctgt acatcctgca attggaaaag 4680 

attgtacttt agttggttta accagcagca ttatttttct aaactaagca gtaagaagga 4740 

attaggtttt atgtgggatc aacagactgg gtctcaaaag aggaaggtga tagaacacag 4800 

tggggagggg gaggtgcact agaaacagag ggcctatgct ttcattctgg ctttgctact 4860 

taatagctgt gtgacccaat cttagagact taacctctct gaacttccat tttctcatgt 4920 

ataaaatggg aaatattaaa ggatactcac tgggctggtg gcttgtgcct gtaatcccag 4980 

cacttgggga ggttgaggtg ggaggatcac ttgagcccag gtgttcaaga ccagcccagg 5040 
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caacatggca agactctgtc tctatgaaaa aattaaaaat tagccaggtg tggtggtgtg 51Q0 

cacctgtagt cttagctact tggtaggctg agatgggagg atcacttiggg cttgggaggt 5160 

caaggctgcg gtgagctgtg attccatcac tgcactccag cccgggcggc agagcgagac 522 0 

actgaatcca aacgacaaca acaacaaaag gcaaaaaaat aaaagtgccc tctttatgga 52 80 

gttgtgtaag gtgaagcata tacactattc aacatagtaa ctatataaag gaagtattgt 5340 

tgttgttact gtagttaata ccattaagtg agatgtttcg tatagtggaa agcacatgga 5400 

ctctgaattc agactggtct gactttgagt ctcagctcca catctagtaa tactatgacc 5460 

aagccctggt taaaatcatg tttttttttc ttcagcctca gtcttctcac atataaaata 552 0 

gggacactgt catttacctc agttttctgt gaggataaaa caacgacagt gtatatgcaa 558 0 

gtattttgta aattttgtag tgctcctcaa gatttagttg gtgtttacta attgtacttt 564 0 

ctcactggaa tggcagatgc tgttggacag cagggacaat gaccactttt gggaacagca 5700 

gttggatggc ttagattgga cagcccaaga catcgtggcg tttttggcca agcacccaga 5760 

ggatgtccag tccagtaatg gttctgtgta cacctggaga ' gaagctttca acgagactaa 5820 

ccaggcaatc cggaccatat ctcgcttcat ggaggtgaat ctgttgctgg gatcatttag 5880 

aaaagactta acggcttctt tctctgagac gttacaataa ggttcaggca ggaggcaagt 5940 

ttagaaataa tgtatagtct catttacaaa actatccctc aagcctaaca caggatttga 6000 

taacaaaagg cacfctaataa atgttagttg agtggttgaa tgagtaaata aactctagct 6 06 0 

ttagtaaatt aactctagct tattctatat aggctcaaga gaatatttct acccattttc 612 0 

ttctaggttt tcctatctca gtgactaatg gtagcaaagc attcccttaa aaaggcatta 6180 

tttgtgaaac ttayctaaaa tcgaattcgg gtccaattaa atttttgaaa ttttatatta 624 0 

aaaattatat tagtagggat gggtaagagg tgttttggtc tggttggttg gttagttgct 63 0 0 

atgactcaga attgctaaga aaacagaaaa gtaagataag atcattgttt taacctcttt 63 60 

tcctccacaa aatcaataaa taacatatcc ctaaattact cttagaattt ctcttaaatt 6420 

gcagtgaaaa accaaaatcc ttcattcttg gttgaaggtt ggaaaactac gttagagagg 6480 

attagagaga gaggatgagc aatcgtgtag tcagcccttg cctcctagtg taggatttgt 654 0 

ctcagccact gcttgttgtc ctggctgcca acgttctcat gaaggctgtt cttctatcag 6600 

tgtgtcaacc tgaacaagct agaacccata gcaacagaag tctggctcat caacaagtcc 6660 

atggagctgc tggatgagag gaagttctgg gctggtattg tgttcactgg aattactcctn 67 2 0 

■ rgcagcattg agctgcccca tcatgtcaag tacaagatcc gaatggacat tgacaatgtg 678 0 

gagaggacaa ataaaatcaa ggatgggtaa gtggaatccc atcacaccag cctggtcttg 6840 

gggaggtcca gagcacctat tatattagga caagaggtac tttattttaa ctaaaaattt 6900 

ggtagaaatt tcaacaacaa caaaaaaact caacttggtg tcatgatttt ggtgaaattg 6 960 

gtacatgact tgctggaagg tttttcatag gtcataaaat aacagtatct tttgatttag 702 0 

catttctact caagggaatt aattccagga attttggtgg caggcacctg taatcccagc 7080 

tactcgggag gctgaggcag gagaattgct tgaacccagg aggcagaggt tgcagtgagc 7140 

taagatcgca tcattgcact cccgcctggg caataagagt gaaactccat ctcaaaaaaa 72 00 

aaaaagatac aaaaatagaa aaaggggctt ggtaagggta gtagggtttt gggcaatttt 7260 

tttttttttt ttttttttta ttgtatggtt ctaaaggaat ggttgattac ctgtggtttg 7320 

gttttaggta ctgggaccct ggtcctcgag ctgacccctt tgaggacatg cggtacgtct 7380 

gggggggctt cgcctacttg caggatgtgg tggagcaggc aatcatcagg gtgctgacgg 7440 

gcaccgagaa gaaaactggt gtctatatgc aacagatgcc ctatccctgt tacgttgatg 7500 

acatgtaagt tacctgcaag ccactgfcttt taaccagttt atactgtgcc agatgggggt 7560 

gtatatatgt gtgtgcatgc gcatgcatgt gtgaatgatc tggaaataag atgccagatg 7620 

taagttgtca acagttgcag ccacatgaca gacatagata tatgtgcaca cactagtaaa 7 680 

cctctttcct tctcatccat ggttgccact tttatctttt tatttttatt tttttttttg 7740 

agatggagtc tcgctctgac gcccaggctg gagtgcagtg gctcgatctc ggctcactgc 7 600 

aaccttfcgcc tcccgggttc aagctattct cctgcctcag cctccacagt agctgggact 7860 

acaggctcat gctgccacgc ccggctgact ttttgtattt tagtagagac gaggtttcac 7920 

catgttaccc aggctagact tcaactcctg agctcaggca atccaccctc cttggcctcc 7980 

caaagtgctg ggattacagg tgtgagccac tgcacccagc ccaccacttt aattttttac 8040 

actctaccct tttggtcaaa atttgctcaa tctgcaagct taaaatgtgt catgacaaac 8100 

acatgcaagc acatactcac acatagatgc agaaacagcg tctaaactta taaaagcaca 8160 

gtttatgtaa atgtgtgcac ttcttctccc taggtggtaa accacatttc aaaacaaccc 8220 

aaataaaact gaacaaagct tcttcctctt agacttttta gaaaatcttt cagtgctgag 82 80 

tcactaagct gccaagttct cattgtggga actatgcctt tggatgtaat gatttcttct 8340 
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aagacaatgg gcggaggtgt agttattgca gacatctgaa atatgtaatg tttcttccag 84 00 

attctggaaa ttctettatt ctctgtggtt ggtggtggtg gtgggatgtg tgtgtgtgtg 8460 

tgtgtgtgtg tgtgtgtgtg tgtgtaggga tcaggatgcg ggaggagctg ggttctgctt 8520 

gtattggttc tctgttttgc attgaatagt gtgtttcctt gtatggctat ctatagcttt 85 80 

tcaaggtcac cagaaattat cctgtttttc accttctaaa caattagctg gaatttttca 8640 

aaggaagact tttacaaaga cccctaagct aaggtttact ctagaaagga tgtcttaaga 8700 

cagggcacag gagttcagag gcattaagag ctggtgcctg ttgtcatgta gtgagtatgt 8760 

gcctacatgg taaagctttg acgtgaacct caagttcagg gtccaaaatc tgtgtgcctt 882 0 

tttactttgc acatctgcat tttctatfcct agcttggaat ctgaaacatn gacaagagct 8 880 

gcctgaaatg tatgtctgtg gtgtgattag agttacgata agcaagtcaa tagtgagatg 894 0 

accttggaga tgttgaactt ttgtgagaga atgagttgtt tttttgtttt ggtttttagt 9000 

actttaacat aatctacctt tagtttaagt atcgctcaca gttacctagt tactgaagca 9060 

agcccccaaa gaaatttggt ttggcaacac tttgttagcc tcgtttttct ctctacattg 912 0 

cattgctcgt gaagcattgg atcatacgta catttcagag tctagagggc ctgtccttct 9180 

gtggcccaga tgtggtgctc cctctagcat gcaggctcag aggccttggc ccatcaccct 9240 

ggctcacgtg tgtctttctt tctccccttg tccttccttg gggcctccag ctttctgcgg 93 0 0 

gtgatgagcc ggtcaatgcc cctcttcatg acgctggcct ggatttactc agtggctgtg 9360 

atcatcaagg gcatcgtgta tgagaaggag gcacggctga aagagaccat gcggatcatg 94 20 

ggcctggaca acagcatcct ctggtttagc tggttcatta gtagcctcat tcctcttctt 9480 

gtgagcgctg gcctgctagt ggtcatcctg aaggtaaggc agcctcactc gctcttccct 954 0 

gceaggaaac tccgaaatag ctcaacacgg gctaagggag gagaagaaga aaaaaaatcc 9600 

aagcctctgg tagagaaggg gtcatacctg tcatttcctg caatttcatc catttatagt 9660 

tggggaaagt gaggcccaga gaggggcagt gacttgccca aggtcaaccc agccgggtag 972 0 

cagctaagta ggatgagagt gcagggttca tgctttccag ataaccacat gctcaactgt 9780 

gccatgctgt ctcattggta gtggttcatg gcagcatctg aaagctattt attttcttag 9840 

atatattggg tggcgattct tcctaagttt ctaagaacaa taatcagaag gatatatatt 9900 

gttgcaggtt agactgtctg gaagcagagg ctgaaataga gtttgatgta tgggtattta 9960 

tgagggctca atacctatgg aagagatatg gaagatgcag gattgggcag agggaggagt 10020 

tgaactgtga tatagggcca accccgtggg gcactctaga gaatatgcag cttgttggag 10080 

ttgttcttca tcgagctgaa acatccagcc ctttgtgctc ccccaaggcc tcectcctga 10140 

caccacctac ctcagccctc tcaatcaatc actggatgtg ggctgccctg ggaaggtcgt 10200 

gccccagggc ctacatggct ctctgctgct gtgacaaacc cagagttgct gatgcctgag 10260 

gccgtctact gacagctggg caacaaggct tccctgaatg gggactctgg gcagtgcagt 1032 0 

tttgtgtctg aaccatacat taatatattt atatccgaat tttctttctc tgcaagcatt 10 3 80 

tcatataaag acacatcagg taaaaataaa tgtttttgaa gcaaaaggag tacaaagaga 10440 

taagaactaa ctaatttaat actagttacc atctgttaca aatagttcct actgattgcc 10500 

aaggactgtt taaacacatc acatgggctt cttcttctat cctcactaac ccttttaaca 1056 0 

gacaaggaaa tgaggctcag gaaggtcaag gactttattg aggttccaca gtaggataca 1062 0 

gttcttgcta aaagcaaccc ctccctcatg ctctgttatc taactgcaag gggaaggtca 10680 

gtggcagagg tagtggtccc atggttggtg cataagagct gctctgagac aactgcatgc 1074 0 

tggtgggtcc tgcagacatg tacccatcag ccggagatag gctcaaaata tccacaagag 10800 

tttggatgat tgtgggaatg cagaatccat ggtgatcaag agggaaagtc aagttgcctg 1086 0 

gccattttcc ttggctttta gacagaaaag ttacgtggga tattatctcc cacagctctt 1092 0 

Gtgtggtgcc accagtcata gtccttatat aaggagaaac cagttgaaat tacctattga 10 98 0 

agaaacaaag agcaaactcg cccactgaaa tgcgtagaaa gccctggact ctgttgtatt 11040 

cataactctg ccattatttt tctgcgtagt tttgggtaag tcacttatct tctttaggat 1110 0 

ggtaatgatc agttgcctca tcagaaagat gaacagcatt acgcctctgc attgtctcta 11160 

acatgagtag gaataaacac tgtctttttt ctgtagatca tacaagtgag tgcttgggat 1122 0 

tgttgaggca gcacatttga tgtgtctctt ccttcccagt taggaaacct gctgccctac 112 80 

agtgatccca gcgtggtgtt tgtcttcctg tccgtgtttg ctgtggtgac aatcctgcag 11340 

tgcttcctga ttagcacact cttctccaga gccaacctgg cagcagcctg tgggggcatc 114 0 0 

atctacttca 'cgctgtacct gccctacgtc ctgtgtgtgg catggcagga ctacgtgggc 11460 

ttcacactca agatcttcgc tgtgagtacc tctggccttt cttcagtggc tgtaggcatt 11S2 0 

tgaccttcct ttggagtccc tgaataaaag cagcaagttg agaacagaag atgattgtct 11580 

tttGcaatgg gacatgaacc ttagctctag attctaagct ctttaagggt a^gggcaagc 11640 
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attgtgtttt attaaattgt ttacctttag tcttctcagt gaatcctggt tgaattgaat 11700 

tgaatggaat ttttccgaga gccagactgc atcttgaact gggctgggga taaatggcat 11760 

tgaggaatgg cttcaggcaa cagatgccat ctctgccctt tatctcccag ctctgttggc 11820 

tatgttaagc tcatgacaaa ccaaggccac aaatagaact: gaaaactctt gatgtcagag 11880 

atgacctctc ttgtcttcct tgtgtccagt atggtgtttt gcttgagtaa tgttttctga 1194 0 

actaagcaca actgaggagc aggtgcctca tcccacaaat tcctgacttg gacacttcct 12000 

tccctcgtac agagcagggg gatatcttgg agagtgtgtg agcccctaca agtgcaagtt 12060 

gtcagatgtc cccaggtcac ttatcaggaa agctaagagt gactcatagg atgctcctgt 12120 

tgcctcagtc tgggcttcat aggcatcagc agccccaaac aggcacctct gatcctgagc 12180 

catccttggc tgagcaggga gcctcagaag actgtgggta tgcgcatgtg tgtgggggaa 1224 0 

caggattgct gagccttggg gcatctttgg aaacataaag ttttaaaagt tttatgcttc 123 0 0 

actgtafcatg catttctgaa atgtttgtat ataatgagtg gttacaaatg gaatcatttt 123 60 

atatgttact tggtagccca ccactcccta aagggactct ataggtaaat actacttctg 124 20 

caccttatga ttgatccatt ttgcaaattc aaatttctcc aggtataatt tacactagaa 124 80 

gagatagaaa aatgagactg accaggaaat ggataggtga ctttgcctgt ttctcacaga 12540 

gcctgctgtc tcctgtggct tttgggtttg gctgtgagta ctttgccctt tttgaggagc 12600 

agggcattgg agtgcagtgg gacaacctgt ttgagagtcc tgtggaggaa gatggcttca 12 660 

atctcaccac ttcggtctcc atgatgctgt ttgacacctt cctctatggg gtgatgacct 1272 0 

ggtacattga ggctgtcttt ccaggtacac tgctttgggc atctgtttgg aaaatatgac 127 8 0 

ttctagctga tgtcctttct ttgtgctaga atctctgcag tgcatgggct tccctgggaa 12 840 

gtggtttggg ctatagatct atagtaaaca gatagtccaa ggacaggcag ctgatgctga 12 900 

aagtacaatt gtcactactt gtacagcact tgtttcttga aaactgtgtg ccaggcagca 12960 

tgcaaaatgt tttatacaca ttgcttcatt taattctcac aaggctactc tgaagtagtt 13020 

actataataa ccagcaattt tcaaatgaga gaactgtgac tcaaagacgt taagtaacca 13 080 

gctttggtca cacaactgtt aaatgttggt acgtggaggt gaatccactt cggttacact 13140 

gggtcaataa gcccaggcga atcctcccaa tgctcaccca attctgtatt tctgtgtcct 132 00 

cagagggggt acaactagga gaggttctgt ttcctgagta caggttgtta ataattaaac 132 60 

atactagctc taaggcctgc ctgtgattta attagcattc aataaaaatt catgttgaat 1332 0 

ttttctttag tacttctttc ttaatataat acatcttctt gaccaagtcc aagaggaacc 13360 

tgcgttggac agttttcata tgagatcaaa ttctgagaga gcaagattta accctttttg 1344 0 

gttcaccttc tgatcctccc ctaaggaggt atacatgaaa tatttattac tcctgcctga 13500 

acttctttca ttgaatatgc aattttgcag catgcagatt ctggatttaa attctgagtc 13560 

ttaacttact ggctgaggga ccttggatag gctccttatc cctcagtttc ctcatctcta 13620 

aaatggggat ggcacctgcc ccgtgggttg ttggaaggac ttacagaggt gcagaatgta 1368 0 

cgttgtacat agcaggtttc agcaaatgtfc agctccctct ttccccacat ccattcaaat 13740 

ctgttccttc tccaaaggat gtgtcaagga ggaaatggac ctggctggga aaccctcaga 13 80 0 

atactgggat gatgctgagc ttggctcata cctgtgcttt gctttcaggc cagtacggaa 1386 0 

ttcccaggcc ctggtatttt ccttgcacca agtcctactg gtttggcgag gaaagtgatg 13 92 0 

agaagagcca ccctggttcc aaccagaaga gaatgtcaga aagtaagtgc tgttgacctc 13980 

ctgctctttc tttaacctag tgctgctgcc tctgctaact gttgggggca agcgatgtct 14 040 

cctgcctttc taaaagactg tgaaaccact ccaggggcag agaaatcaca tgcagtgtcc 14100 

ctttccaaat cctcccatgc catttatgtc caatgctgtt gacctattgg gagttcacgg 14160 

tGtcgatccc tgagggacat tttctttgtt gtcttggctt ctagaagagt atcttttact 14220 

tgccccctcc caaacacaca tttcatggtc tcctaacaag ctagaagaaa gaggtaaaga 1428 0 

caagcgtgat tgtggaacca tagcctcgct gcctgcctgt gacatggtga cctgtgtatc 14340 

agcctgtgtg ggctgagacc aagtggctac cacagagctc agcctatgct tcataatgta 14400 

atcattaccc agatccctaa tcctctcttg gctcttaact gcagacagag atgtccacag 14460 

cfccatcaaag gctctgcttc tgggttcttt gtgcttagag tggcttccta aatatttaat 1452 0 

aggtcccttt tctgccagtc tcttctgtgc ccatcccctg attgcccttg gtaaaagtat 14580 

gatgcccctt agtgtagcac gcttgcctgc tgttcctaat catcttctcc tacctcctct 14 64 0 

ttacacctag ctcctgtttc agtcacctag aaatgctcac agtcgctgga atatgtcatg 14700 

ttcttccaca cctccatgcc tttgtaggta ctgtttgctc tcacaggaga actttctctc 14760 

taacttgcct atcttctcaa ctcctccttt: ctctccaaga tctagttccg gaGcccctcc 14Q20 

cctgagcatc cctccttggt tctcaggtag tcagtcactc tctgccctga acttccatgg 14 880 

cacgtgaaag aaaatctttt tattttaaaa caattacaga ctcacaagaa gtaatacaaa 14 94 0 
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ttacatgagg gggttccctt aaacctttca tccagtttcc ccaatggtag cagcatgtgt 15000 

aactgtagaa tagtatcaaa accatgaaat tgacataggt acaattcaca aaccttcttc 15060 

agatttcact agctttatgt gcgctcattt gtgtgtgtgt gtgcgtattrt agttctatgc 15120 

aattttatca tgtgtgaatt catgtaatta ctagctcagt caagctgcag aaatatctca 15180 

ttgtcacaaa gctccttcat gctacccctt aatggccaca gccacctccc ttcttcctca 15240 

gttcctgaca cctgtcaacc actaatgcgt tcctcgtttt tacagtttta ttatttctag 15300 

aatgttacat aaatggaacc atacagtagg tatccttttg atactggctt tttttttttt 15360 

ttcactcagc agtattccct tagatctatc caagttgtgt gtgtcaacag ttcattcctc 1542 0 

ttcactgctg agtagtgttc cctgggaggg gtgtatcaca gttccatggc atttttagat 15480 

gtatttttta aacagctttc agcatcctct attttaattg ttcatcaagt cctttttccc . 15540 

aatagactct gaatgctcct ttatcatcgt attcccatca ccaacatcag tacccaaata 15600 

ggcGCtaaat aaacatttat agcctcctgc ctgcctgaga aaccagggtg gacatggaga 15660 

gaaggcactt ctgaaagttc aagcgcagtg csctgtgtcc ttacactcca ctcctcagtg 15720 

ctttctgtgg gttcatttct gtcttctctc ctgtcacagt ctgcatggag gaggaaccca 15780 

cccacttgaa gctgggcgtg tccattcaga acctggtaaa agtctaccga gatgggatga 15840 

aggtggctgt cgatggcctg gcactgaatt tttatgaggg ccagatcacc tccttcctgg 1S900 

gccacaatgg agcggggaag acgaccacca tgtaagaaga gggtgtggtt cccgcagaat 15960 

cagccacagg agggttctgc agtagagtta gaaatttata ccttaggaaa ccatgctgat 16020 

ccctgggcca agggaaggag cacatgagga gttgccgaat gtgaacatgt tatctaatca 16080 

tgagtgtctt tccacgtgct agtttgctag atgttatttc ttcagcctaa aacaagctgg 16140 

ggcctcagat gacctttccc atgtagttca cagaattcfcg cagtggtctt ggaacctgca 16200 

gccacgaaaa gatagafctac atatgttgga gggagttggt aattcccagg aactctgtct 16260 

ctaagcagat gtgagaagca cctgtgagac gcaatcaagc tgggcagctg gcttgattgc 16320 

cttccctgcg acctcaagga ccttacagtg ggtagtatca ggaggggcca ggggctgtaa 16380 

agcaccagcg ttagcctcag tggcttccag cacgattcct caaccattct aaccattcca 16440 

aagggtatat ctttgggggg tgacattctt ttcctgtttt ctttttaatc tttttttaaa 16500 
acatagaatt aatatattat gagcttttca gaagattttt aaaaggcagt cagaaatcct „ 16560 

actacctaac acaaaaattg tttttatctt tgaataatat gfctcttgttt gtccattttc 16620 

catgcatgcg atgttaggca tacaaaatac attttttaaa gaatactttc attgcaaatt 16680 

ggaaacttcg tttaaaaaat gctcatacta aaattggcat ttctaaccca taggcccact 16740 

tgtagttatt taccgaagca aaaggacagc tttgctttgt gtgggtctgg tagggttcat 16800 

tagaaaggaa tgggggcggt gggagggttg gtgttctgtt ctctctgcag actgaatgga 168 60 

gcatctagag ttaagggtag gtcaaccctg acttctgtac ttctaaattt ttgtcctcag 16920 

gtcaatcctg accgggttgt tccccccgac ctcgggcacc gcctacatcc tgggaaaaga 16980 

cattcgctct gagatgagca ccatccggca gaacctgggg gtctgtcccc agcataacgt 170 4 0 

gctgtttgac atgtgagtac cagcagcacg ttaagaatag gccttttctg gatgtgtgtg 1710 0 

tgtcatgcca tcatgggagg agtgggactt aagcatttta ctttgctgtg tttttgtttt 17160 

ttcttttttt cttttttatt tttttgagat ggagtctcgc tcfcgtagcca ggctggactg 1722 0 

tagtggcgcg atctcggctc actgcaacct tggcctccca ggttcaagcg attctcctgc 17280 

C3tcagcctcc cgagtagctg ggactctagg cacacaccac catgcccagc taatttttgt 1734 0 

gtttttagta gagacggggt ttcaccatgt tggccaggat ggtctcaatg tcttgacctc 1740 0 

gtgatccgcG cacctcggtc tcccaaagtg ctgggaacac aggcatgagc cactgtgtct 174 60 

ggccacattt tactttcttt: gaatatggca ggctcacctc cgtgaacacc ttgagaccta 17520 

gttgttcttt gattttagga gaagtgggag gtgaatggtt gagctgtaga ggtgacatca 175 BO 

gcccagccag tggatggggg cttgggaaac attgcttccc attattgtca tgctggaggg 17640 

ccctttagcc catcctctcc ccccgccacc ctccttattg aggcctggag cagacttccc 17700 

agacctggta gtgcttcagg gccctggtat gatggaccta tatttgctgc ttaagacatt 17760 

tgctcccact caggttgtcc catcagccat aaggccccca gggagcccgt gtgatggagc 1782 0 

agagagagac ctgagctctg caatcttggg caaggctfctt cccttatgtt tcttcttatc 17880 

taaagtgaac agctggggct catgtgctcc ctcctcatct aaagtgaaca catggggctc 1794 0 

atgtgcaggg tcctccccgc tttcagagcc tgaggtcccc tgaggctcag gaagg.ctgct 18000 

ccaggtgagt gccgagctga cttcttggtg gacgtgctgt ggggacagcc cattaaagac 18060 

cacatcttgg ggcjcctgaaa ttgaaagttg taactgcctg gtgcatggtg gccaggcctg 1812 0 

ctggaaacag gttggaagcg atctgtcacc tttcactttg atttcctgag cagctcatgt 1818 0 

ggttgctcac tgttgttcta ccttgaatct tgaagattat ttttcagaaa ttgataaagt 1824 0 
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tattttaaaa agcacgggga gagaaaaata tgcccattct catctgttct gggccagggg 18300 

acactgtatt ctggggtatc cagtagggcc cagagctgac ctgcctccct gtccccaggc 18360 

tgactgtcga agaacacatc tggttctatg cccgcttgaa agggctctct gagaagcacg 18420 

tgaaggcgga gatggagcag atggccctgg atgttggttt gccatcaagc aagctgaaaa 18480 

gcaaaacaag ccagctgtca ggtgcggccc agagctacct tccctatccc tctcccctcc 18540 

tcctccggct acacacatgc ggaggaaaat cagcactgcc ccagggtccc aggctgggtg 1860 0 

cggttggtaa cagaaacttg tccctggctg tgcccctagg tcctctgcct tcactcactg 18660 

tctggggctg gtcctggagt ttgtcttgct ctgttttttt gtaggtggaa tgcagagaaa 1872 0 

gctatctgtg gccttggcct ttgtcggggg atctaaggtt gfecattctgg atgaacccac 1878 0 

agctggtgtg gacccttact cccgcagggg aatatgggag ctgctgctga aataccgaca 18840 

aggtgcctga tgtgtattta ttctgagtaa atggactgag agagagcggg gggcttttga 18900 

gaagtgtggc tgtatctcat ggctaggctt ctgtgaagcc atgggatact cttctgttak 18960 

cacagaagag ataaagggca ttgagactga gattcctgag aggagatgct gtgtctttafc 19020 

tcatcttttt gtccccaaca tggtgcacta aatttatggt tagttgaaag ggtggatgct 19080 

taaatgaatg gaagcggaga ggggcaggaa gacgattggg ctctctggtt agagatctga 19'3 40 

tgtggtacag tatgaggagc acaggcaggc ttggagccaa ctctggcttg gccctgagac 19200 

attgggaaag tcacaacttg cctcaccttc tttgccgata ataatagtgg tgcgttaccc 19260 

catagaggat taaattaaat gagaatgcac acaaaccacc tagcacaatg cctggcatat 1932 0 

agcaagttcc caaataaaat gcgtactgtt cttacctctg tgaggatgtg gtacctatat 193 80 

atacaaagct ttgccattct aggggtcata gccatacagg gtgaaaggtg gcttccaggt 19440 

ctcttccagt gcttacccct gctaatatct ctetagtccc tgtcactgtg acaaatcaga 1950 0 

actgagaggc ctcacctgtc ccacatcctt gtgtttgtgc ctggcaggcc gcaccattat 19S60 

tctctctaca caccacatgg atgaagcgga cgtcctgggg gacaggattg ccatcatctc 19620 

ccatgggaag ctgtgctgtg tgggctcctc cctgtttctg aagaaccagc tgggaacagg 1968 0 

ctactacctg accttggtca agaaagatgt ggaatcctcc ctcagttcct gcagaaacag 1974 0 

tagtagcact gtgtcatacc tgaaaaaggt gagctgcagt cttggagctg ggctggtgtt 19800 

gggtctgggc agccaggact tgctggctgt gaatgatttc tccatctcca ccccttttgc 19860 

catgttgaaa ccaccatctc cctgctctgt tgcccctttg aaatcatatc atacttaagg 19920 

catggaaagc taaggggccc tctgctccca ttgtgctagt tctgttgaat cccgttttcc 199S0 

ttttcctatg aggcacanag agtgatggag aaggtcctta gaggacatta ttatgtcaaa 20040 

gaaaagagac ttgtcaagag gtaagagcct tggctacaaa tgacctggtc gttcctgctc 2 0100 

attacttttc aatctcattg accttaactt ttaaactata aaacagccaa tatttattag 2 0160 

gcactgattt catgccagag acactctggg cattgaaaga aagtaatgat aatagttaat 20220 

tttatatagc gttgttacca tttcaacctt tttttttttt: taacctctat catctcaatt 2 02 8C 

aaag 202 84 



<210> 22 

<211> 7052 

<212> DNA 

<213> Homo sapiens 

<400> 22 

gtgaacacac attaaagcat gagaagcatg aactagacat gtagccaggt aaaggccttg 60 

ctgagatggt tggcaaaggc ctcattgcag cattcattgg caggccacag ttctcttggc 120 

agctctgctt cctgaccttt caccctcagg aagcgaggct gttcacacgg cacacacatg 180 

ccagacaggg tcctctgaag ccacggctgc cagtgcatgt gtcccaggga aagctttttc 24 0 

ctttagttct cacacaacag agcttcttgg aagccctccc cggcgaaggt gctggtggct 3 00 

ctgccttgct ccgtccctga cccgttctca cctccttctt tgccatcagg aggacagtgt 360 

ttctcagagc agttctgatg ctggcctggg cagcgaccat gagagtgaca cgctgaccat 42 0 

cggtaaggac tctggggttt cttattcagg tggtgcctga gcttccccca gctgggcaga 4 80 

gtggaggcag aggaggagag gtgcagaggc tggtggcgct gactcaaggt ttgctgctgg 540 

gctggggctg ggtggctgcg ggggtgggag cagcttggtg gcgggttggc ctaatgcttg 600 

ctggggtgcc tggggctcgg tttgggagct agcagggcag tgtcccagag agctgagatg 660 
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attggggttt ggggaatccc ttaggggagt ggacactgaa taccagggat gaggagctga 720 

gggccaagcc aggagggtgg gatttgagct tagtacataa gaagagtgag agcccaggag 780 

atgaggaaca gccttccaga tttttcttgg gtagcgtgtg taggaggcca gtgtcaccag 840 

tagcatatgt ggaacagaag tcttgaccct tgctatctct gcctagtcct aatggctggc 90 0 

ttttcccagg aaggcttctg cttccatgga ctgttagafct aaccctttat ttaggtaaat 960 

gagggaacct actttataag cataggaaag ggtgaagaat cttttaagat tcctttactc 1020 

aagttttctt ttgaagaatc ccagagctta ggcaatagac accagacttfc gagcctcagt 1080 

tatccattca cccatccacc cacccaccca cccatccttc catcctccca tcctcccatt 1140 

cacccatcca cccatccagc tgtccaccca ttctacactg agtacctata atgtgcctgg 1200 

ctttggtgat acaaaggtga ataagacata gtcctttcct ttgcccccaa ccctcagacc 1260 

agagatgaac atgtggaatg acctaaacac ctggaacagg tgtggtgtat gagcggcagg 1320 

cctctgatga gagggtgggg gatggccagc cctcactccg aagcccctct gagttgattg 13 8 0 

agccatcttt gcattctggt cctgcagatg tctctgctat ctccaacctc atcaggaagc 1440 

atgtgtctga agcccggctg gtggaagaca tagggcatga gctgacctat gtgctgccat 1500 

atgaagctgc taaggaggga gcctttgtgg aactctttca tgagattgat gaccggctct 1560 

cagacctggg catttctagt tatggcatct cagagacgac cctggaagaa gtaagttaag 162 0 

tggctgactg tcggaatata tagcaaggcc aaatgtccta aggccagacc agtagcctgc 168 0 

attgggagca ggattatcat ggagttagtc attgagtttt taggtcatcg acatctgatt 1740 

aatgttggcc ccagtgagcc atttaagatg gtagtgggag atagcaggaa agaagtgttt 1800 

tcctctgtac cacagtacat gcctgagatfc tgtgtgttga aaccagtggt acctaacaca 1860 

tttacatccc aaccttaaac tcctatgcac ttatttaccc tttaatgagc ctctttactt 192 0 

aagtacagtg fcgaggaacag cggcatcagg atcacttggg aacttgttag aaattcagca 198 0 

acttgggccc agctcagacc tactgaatca gaatcaggag caattctctg gtgtgactgt 2040 

gtcacagcca ggtatcaact ggattctcat acataggaaa tgacaaacgt ttatggatgg 2100 

atagtctact tgtgccaggt gctgagattt gttttttgtt ttttgatttt tttttaatca 2160 

ctgtgacctc atttaattct caaaaaaaga tgaaaaaatg aacactcagg aatgctgaca 222 0 

tgagattcag aatcaggggt ttggggcttc aaagtccatc ctctctttat ccatgtaatg 2280 

cctcccctta gagatacaac atcacagacc ttgaaggctg aaggggatat aaaagctgtc 2340 

tggccaagtg gtctccaagc ttgacagtgc agcjagaatca cctggggata ttattaaaaa 2400 

taaacatact aaggtttggc ttcagggcct gtgaatcaga atttctggag gtgaggcctt 2460 

gaagtctgta tttctattgc atactttgga cacagtggtc tatagactag agtttggaaa 252 0 

tgattgcgct cattcagatt ctcttctgat gtttgaattg ctgccatcat atttctagtg 25 80 

ctctatttcG tcctgctcat tctgtcttgg ataacttatc atagtactag cctactcaaa 2 64 0 

gatttagagc cacagtcctg aaagaagcca cttgactcat tccctgtagg ttcagaataa 2700 

atttcttctg cgcagtgtct gtcatagctt tttttaaatt tttttttatt tttgatgaga 2760 

ctggagtttt gctcttattg cccaagctgg agtgcagtgg tgcgattttg gctcactgca 2S20 

acctccacct cccaggttca agcgattctc ctgcctcagc ctcccaagta gctgagatta 2880 

caagcatgtg ctaceacgcc cagctaattt tgtattttta gtagagatgg gttttatcca 2 940 

tgttggtcag gctggtctcg agctccagac ctcaggtgat ctgcccgcct cggcctccca 3 000 

aagtgctggg attataggcc tgagccacag cgctcagcca taactttaat ttgaaaatga 3060 

ttgtctagct tgatagctct caccactgag gaaatgttct ctggcaaaaa cggcttctct 312 0 

cccaggtaac tctgagaaag tgttattaag aaatgtggct tctactttct ctgtcttacg 3180 

gggctaacat gccactcagt aatataataa tcgtggcagt ggtgactact ctcgtaatgt 3240 

tggtgcttat aatgttctca tctctctcat tttccagata ttcctcaagg tggccgaaga 33 OO 

gagtggggtg gatgctgaga cctcaggtaa ctgccttgag ggagaatggc acacttaaga 33 60 

tagtgccttc tgctggcttt ctcagtgcac gagtattgtt cctttccctt tgaattgttc 342 0 

tattgcattc tcatttgtag agtgtaggtt tgttgcagat ggggaaggtt tgttttgttg 34 80 

taaataaaat aaagtatggg attctttcct tgtgccttca gatggtacct tgccagcaag 354 0 

acgaaacagg cgggccttcg gggacaagca gagctgtctt cgcccgttca ctgaagatga 3600 

tgctgctgat ccaaatgatt ctgacataga cccaggtctg ttagggcaag atcaaacagt 3660 

gtcctactgt ttgaatgtga aattctctct catgctctca cctgttttct ttggatggcc 3 72 0 

tttagccaag gtgatagatc cctacagagt ccaaagagaa gtgaggaaat ggtaaaagcc 378 0 

acttgttctt tgcagcatcg tgcatgtgat caaacctgaa agagcctatc catatcactt 3 84 0 

cctttaaaga cataaagatg gtgcctcaat cctctgaacc catgtattta ttatcttttc 3 900 

tgcggggtcc Cagtttcttg tatacattag gtgtttaatt gttgaacaaa tattcattcg 3 960 



wo 00/55318 



PCT/IBOO/00532 



30 

agtagatgag tgattttgaa agagtcagaa aggggaattt gctgttagag ttaattgtac 4020 

cctaagactt agatatttga ggctgggcat ggtggctcat gccagtaatc ccagcgcttt 4080 

gagaggctga ggtgggtaga tcaactgagg tcaggagttt gagaccagtc tgaccaacaa 4140 

ggtgaaaccc cgtctctact aaatacaaaa aattagccga gtgtggtggc acatgcctgt 4200 

catcccagct acttgggagg ctgaggcagg agaatcgctt gaacccagga ggcagaggtt 4260 

gcagtcagcc acggttgcgc cattgcactc cagactgggc aacaagagtg aaaactccat 4320 

ctcaaaaaag aaaaaaaaag aattagatat tttggatgag tgtgtctttg tgtgtttaac 4380 

tgagatggag aggagagcta agacatcaaa caaatattgt taagatgtaa aagcacatca 444 0 

gttaggtatc attagtttag gacaaggatt tctagaaaat ttttaggaac agaaaacttt 4S00 

ccagttctct cacccctgct caaagagtgt atggctctta cattatatat aactgcctga 4560 

cttcatacag tatcagtact tagatcattt gaaatgtgtc cacgttttac caaaatataa 4S20 

tagggtgaga agctgagatg ctaattgcca ttgtgtattc tcaaatatgt caagctacgt 4680 

acatggcctg tttcatagag tagtctataa gaaattgatg acttgattca tccgaatggc 4740 

tggctgtaac acctggttac gcatgaacac ctcttttcag ttgtctcaag acacctttct 4800 

tttctgtact tatcagacaa ggactgaaag gcagagactg ctactgttag acattttgag 4860 

tcaagctttt ccttggacat agctttgtca tgaaagccct ttacttctga gaaacttcta 4920 

gcttcagaca catgccttca agatagttgt tgaagacacc agaagaagga gcatggcaat 4980 

gccgaaaaca cctaagataa taggtgacct tcagtgttgg cttcttgcag aatccagaga 5040 

gacagacttg ctcagtggga tggatggcaa agggtcctac caggtgaaag gctggaaact 5100 

tacacagcaa cagtttgtgg cccttttgtg gaagagactg ctaattgcca gacggagtcg 5160 

gaaaggattt tttgctcagg tgagacgtgc tgttttcgcc agagactctg gcttcatggg 522 0 

tgggctgcag gctctgtgac cagtgaaggc aggatagcat cctggtcaag atatggatgc 52 8 0 

cggagccaga tttatctgta tttcaatccc agttctattc cttgccagtt gtgtatccgc 5340 

tggcaagtta cttctctatg cctcaatctc ctcatctgta aaatggggat aataatatta 54 0 0 

cctgcaatac agggttgtta cgaaaataaa aatgaatagg tgcttagaat ggggcctgac 54 60 

attagtaagt gcttagtttt gtgtgtgtat atgttatttt tattttggag gagaacataa 552 0 

aaaggaca.aa gtgtagaaaa actggttggg tgtattcagc tgtcataaca tgagagttgt 558 0 

tatgcccaga tgcacttgac atgtgaattt attagaaaca tgatttttct ctgagttgat 564 0 

gtttaactca aactgataga aaagataggt cagaatatag ttggccaaca gagaagactt 570 0 

gttagactat tgtctgcatg tcagtgtttg catgctaact tgcttagtta gaaaggttaa 57 60 

attttttcac tcfcataaaat caagaaatat agagaaaagg tctgcagaga gtctttcatt 5820 

tgatgatgtg gatattgtta agagcgggag tttggagcat acagagctca agttgaatcc 5880 

tgactttgct acttattggc tatatgacct tgggcaagct gcttagtctc tctgatcctc 594 0 

agttaccttt gtttgttgat gatgaccatt gataacacaa ccataaataa fcgacaacata 600 0 

gagatagttc tcattatagt agttgttata cagaattatt cactcaatgt taattttctg 606 0 

cattgaaatc ccagaacatt agaattgggg gcattatttg aatctttaag gttataagga 612 0 

atacatttct cagcaataaa tggaaggagt tttgggttaa cttataaagfc atacccaagt 6180 

catttttttt cagagaagat atggtagaaa gtcttaggag gttgaagaag gaattggata 6240 

tttattcttt ctgagactat catgggagat aatgactatg gttgtccatg attggagccg 630 0 

ttgctgtaga gttggtttta ttatagtgta ggatttgaat gggccatgtg ttctcagacc 6360 

tcagaataaa aagagaaaac tgaggccagt ggggagcgtg acttcacatg ggtacacttg 642 0 

tgctagagac agaaccagga ttcaggactt ctggctcctg gtcctgggtt catggcccaa 6480 

tgtagtcttt ctcagtcttc aggaggagga agggcaggac ccagtgttct gagtcaccct 654 0 

gaatgtgagc actatttact tcgtgaactt cttggcttag tgcctctgcc aggtggccat 6600 

aacctctggG cttgtgttgc cagagaaaag gtttagttfct caggctccat tgcttcccag 666 0 

ctgccaagaa tgccttggtg cagcacagtc ataggccctg cattcctcat tgccgtgctg 6720 

gttggtcggg gaggtgggct ggactcgtag ggatttgccc cttggccttg tttctaacac 6780 

ttgccgtttc ctgctgtccc cctgccccct ccactgcctg ggtaaagatt gtcttgccag 6 84 0 

ctgtgttfcgfc ctgcattgcc cttgtgttca gcctgatcgt gccacccttt ggcaagtacc 6900 

ccagcctgga acttcagccc tggatgtaca acgaacagta cacatttgtc aggtatgttt 6 96 0 

gtcttctaca tcccaggagg gggtaagatt cgagcagacc aaagatgttt acgagggcca 7020 

agggaatgga cttcagaatt acacggtgga at 7 052 
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<2Z0> 23 

<211> 2S34 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gggaagcatt taaaaaaaaa aaagtatata tatatatata tatatatata tgtaatgtga 60 

attggcctct ttttetctaa gcccacattt tcttcttaca tagttcaggt ttactttatt 120 

ttttcctttc cggccgctga ccctgtattg cccgtagttg tggaacatag catgtgtttg X80 

tgacctgtgc ctgttatttt tgtgctttct agttgtgcat gcaaagagta caaagttttc 240 

ttgccctttc ttggaaaatc ctgcttgtct gtgccaaagg gataattgtg aaagcacttt 300 

tgaaat.actt: aatgagttga ttttcttcaa attiaaaaaaa atatataaat gtatatgtgt 360 

atgtacatgt gtgcacacat acacaccttit atacatacag cccatttaaa acaagctcca 420 

ctttggagtg ctctacgtca ccctgafcgcc gaatacaggg ccagagtctg agatccttct 480 

gggtggtttc tgfcgttttigt tcatttctgt tttaagagcc tgtcacagag aaatgcttcc 540 

taaaatgttt aatttataaa aaaattttta tctctcgatt actggtttta atgaattact 600 

aagctggctg cctctcatgt acccacagca atgatgctcc tgaggaeacg ggaacectgg 660 

aactcttaaa cgccctcacc aaagacectg gcttcgggac ccgctgtatg gaaggaaacc 720 

caatcccgtg agtgccactt tagceataag cagggcttct tgtgcttgtt gcctggtttg 780 

atttctaata tgctgcattt ataaactgca tgccacattg tgaccgccag catttgccct 840 

ttgaattatt attatgtttt atttacaaaa agcgaaggta gtaaccgaac taaattatct 900 

aggaacaaac gtttggagag tcttctaaca ccgyscaaag cacgtcatta cagacatttg 960 

tttactgatt tagaacctta atattfcaatt taaatacgca ctttacactt actgatgaaa 1020 

tgcttttcct ttctttctct cccagcccct gtacttaagt gcttcaafcag gctctcatta 1080 

tatatgattt tiraggttteg cttatcagct tcttcgcttt tataatccga aaagatggca 1140 

tatgaatttt tataaaaagg gacactttct tcttctcaaa ttgtatattt ttattgtact 1200 

ttccttcaaa accccctttt aaaaagtaag cagtggataa ataaattcag tgaagcatcc 1260 

atatgaccct taagtgagtg taggggaagg gaggtcacca gatcactgtg agtgaagatg 1320 

gtggagaggt gaggatctta tgaggccgtg ctcaaggctg gtagaggcgg gctagtgttt 1380 

ccaggtttag gcagaatctc agctgaggtc atgaaacaac agtgatctct gaaaaattat 1440 

ggcaaggtgg gaaggtgctg gagaattgga gagggggcaa acttgacttt caagtttcaa 1500 

tgggaagata ggtgactctg cacaccacag aacagtgagc atgataacct gtttatacaa 1560 

ggttctagag cagatttcta aatggatagc tactgtgtgc ttgtttgttc ttaattagta 1620 

ttggatagtt act aaa tact tgttagtact tagtacataa tgggtggtaa atcctagcag 16 SO 

ctaatattgg ttcccaaata accagatgac aaggatagag aaggacacag acacggccta 1740 

tctggatttc atggtgcctt tgattttcca catgaaggtt gtgtagggaa gatagaagca 1800 

tgagatgaga tgataatata gttatctgga ttcatcactg gccagctgaa ccatatgaac 1860 

tcatggattg atgcfcagctt aggaaggctc tgtaggagcc agaactgggc tgagagccag 1920 

cGcatagaga caaaagaggc ccggccctga catcagaggg ttcaaacatg atgtctgagc 1980 

Gccacctaca gtctgccgga ggtggttgga aggaagagcc tttatcctta caattcttac 2040 

tgaaattcaa atttttaggt tttgcaaaaa aatggtggac ctgaaggaaa tttgacagga 2100 

gcatgtctca gctgtattta aatttgtctc agccaatccc cttttgaatg ttcagagtgt 2160 

aagcctcagg agggcagcgc gtcttagfcgt gacttttctg gtcagttcag gtgctttaag 2220 

gagacaatta gagatcaatc tggaaaactt catttgaatt tttaatacat aagaaaacaa 2280 

taagaaatag ttaaaaatat atatttatat aatatatata tgtgtgtgtg tgtgtgtgtg 2340 

tgtgtgtgtg tatatatata tatattttat ttatttattt ttttttgaga tggagtctcg 240 0 

ctctgttgcc caggctggag tgcagtggct caatcttggc tcactgccac ctctgcctcc 2460 

caggttcaag tgattctcct acctcagcct cctgagtagc tgggattaca agcatgtgcc 2520 
accacactgg ctaa , 2534 

<210> 24 

<211> 2341 

<212> DNA 

<2 13 > Homo sapiens 
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<400> 24 

tcttgccagt ctctactcat ttttcagcac atcgagcata agatccagac tctttcccag 60 

gcctctctca tctggctcct ctcctcctcc tttatcatta ctcttcttcg tagcttatcc 120 

tactccagcc atgctgtctt cctattatrtc ctaaaaarta gaaatgcatt tcttcctagg 180 

gcctttgtac ctgcacttgc catcgctttt gdtcagaatg ttctttttgc caagcttttg 240 

cccagcttgt tctccatcat tgttatgttt tggctgaaat gtcttctctt agtaggfctca 300 

ttctccccag tcactgtctt tttattttgc tttattttgg gccatctaag gttatcttat 360 

tagtgtattt gttgttcgtc tcctccatgg gcatacacct ccatgaaggc aggtattttc 420 

aecfctaggcG ctcgaatata ctggacagca tctggeacgt agtagatgct caacgaatgt 480 

ttgttgtgtg agcaaatggt tggttgattg gattgaactg agttcagtat gtaaatattt: 540 

agggcctctt tgcattctat tttacttatg tataaaatga tacataatga tgatataaat 600 

gatgtcacag tgtacaaggc tgttgtggga tcaagcaatc aaatgagatc atgcttgtct 660 

tttccaaatg gtgagggaat agatgcatgt ttgtggttgt tacggaatga tcctgtgctc 720 

ctgaggcaac agaaaggcca ggccatctct ggtaatccta ctcttgctgt cttccctttg 780 

cagagacacg ccctgccagg caggggagga agagtggacc actgccccag ttccccagac 840 

catcatggac ctcttccaga atgggaactg gacaatgcag aacccttcac ctgcatgcca 300 

gtgtagcagc gacaaaatca agaagatgct gcctgtgtgt cccccagggg caggggggct 960 

gcctcctcca caagtgagtc actttcaggg ggtgattggg cagaaggggt gcaggatggg 102 o 

ctggtagctt ccgcttggaa gcaggaatga gtgagatatc atgttgggag ggtctgtttc 1080 

agtcttttfct gttttttgtt tttttttctg aggcggagtc ttgctctgtc gcccaggctg 1140 

gagtgctgtg gcatgatctt gcctcactgc aacctccacc tcccaggttc aagcgattct 1200 

cctgcctcag cctcctgagt agctgggatt acaggcacgc accaceatgt ctggctaatt 1260 

tttgtgtttt tagtagagat agggtttcgc cgtgttggct aggctggtct ggaattcctg 1320 

acctcaggtg atccacccgc eteggcctcc caaagtgctg ggattacagg cgtgagccac 13 80 

tacgcccagc cetgttitcag tctttaactc gcttcttgtc ataagaaaaa gcatgtgagt 1440 

tttgagggga gaaggtttgg accacact^t gcccatgcct gtcccacagc agtaaagtca 1500 

caggacagac tgtggcaggc ctggcttcca atcttggctc tgcaaaaaat gage tgg tag 1560 

cctttgacag gcctgggcct gtttcttcac ctctgaatta gggaggctgg accagaaaac 1620 

ecctgtggat cttgtcaact ctggtattct tagagactct gtttgggaag gagtcctgag 1680 

ceattttttt tttcttgaga afcttcaggaa gaggagtgct tatgatagct ctctgctgct 1740 

tttatcagca accaaattgc aggatgagga caagcaattc taaatgagta caggaactaa 1800 

aagaaggctt ggttaccact ctlcgaaaata atagctagtc caggtgcggg gtggctcaca 1860 

cctgtaatct cagtattttg ggatgccgag gtggactgat cacctaaggt caggagttcg 1320 

aaaccagctt: ggccaatgtg gcgaaaccct gtctctacta aaaattcaaa aattagccag 1980 

gcatggtggc acafcgcctgt aatcccagtt acttgggagg ctgaagcagg agaattgctt 2 04 0 

gaacctggga ggtggaggtc gcagggagcc aaaattgcgc cactgtactc cagcctgagc 2100 

aacacagcaa aactrccatat caaaaaataa aatgaataaa ataacagcta atct:agtcat 2160 

cagtataact ccagtgaaca gaagatttat taggcatagt gaatgatggt gcttcctaaa 2220 

aatctcttga ctacaaagaa tctcatttca atgtttattg tttagatgtt cagaataaat 22 BO 

tcttgggaaa gaccttgget tggtgtaagt gaattaccag tgccgagggcs agggtgaacc 2340 

aagtctcagt gctggttgac tgagggcagt gtctgggacc tgtagtcagg tttccggtca 2400 

cactgtggac atggtcactg ttgtccttga tttgttttct gtttcaattc ttgtctataa 2460 

agacccgtat gcttggtttt catgtgatga cagagaaaac aaaacactgc agatatccet 2520 

caggacctga caggaagaaa catttcggat tatctggtga agacgtatgt gcagatcata 25 a o 

gccaaaaggt gactttttac taaacttggc ccctgcctta ttattactaa ttagaggaat 2640 

taaagaccta caaataacag actgaaacag tgggggaaat gccagattat ggcctgattc "2700 

tgtctattgg aagcttagga tattratccca aactagaaaa gatgacgaga gggactgtga 2760 

acattcagtt gtcagcttca aggctgaggc agcctggtct agaatgaaaa tagaaatgga 2 82 0 

ttcaacgtca aattttgcca c 2 841 

<210> 25 

<2ai> 852 

<212> DNA 

<213> Homo sapiens 



wo 00/55318 



PCT/IBOO/00532 



33 

<400> 25 

gcatgcfcgga gtgatagtga ccatgagttt ctaagaaaga agcataattt ctccatatgt 60 

catccacaat tgaaatatta ttgttaattg aaaaagcttc taggccaggc acggtggctc 120 

atgcctgtaa tcccagcact ttaggagcca aggcgggtgg atcacttgag gtcaggagtt 180 

tgagaccagc ctggccaa.ca tggggaaacc ctgtctctac taaaaataca aaataagctg 240 

ggcgtggtgg tgcgtgcctg taatcccagc tacttgggag gctgaggcag gagaactgct 300 

tgaatctggg aggcggaggt tgcagtgagc tgagttcatg ccattgcatt ccagcctggg 3S0 

caacaagagc gaaaccatct: cccaaaagaa aaaaaaaaga aagaaaaagc ttictagtttg 420 

gttacatctt: ggtctataag gtggtttgta aattggttta acccaaggcc tggttctcat 480 

ataagtaata gggtatttat gatggagaga aggctggaag aggcctgaac acaggcttct 540 

tttctctagc acaaccctac aaggccagct gattctaggg ttatttctgt ccgttcctta 600 

tatcctcagg tggatactta ctccttttgc atcattagga ataggctcag tgctttcttt 660 

gaactgattt tttgtttctt tgtctctgca gcttaaagaa caagatctgg gtgaatgagt 72 0 

ttaggtaagt tgctgtcttt ctggcacgtt tagctcaggg ggaggatggt gttgtaggtg 780 

tgcttggatt gaagaaagcc ttggggattg tttgtcactc acacacttgt gggtgccatc 840 

tcactgtgag ga 852 

<210> 26 

<211> 6285 

<212> DNA 

<2X3> Homo sapiens 

<400> 26 

gctttataga gtttctgcct agagcatcat ggctcagtgc ccagcagccc ctccagaggc 60 

ctctgaatat ttgatatact gatttccttg aggagaatca gaaatctcct gcaggtgtct 12 0 

agggatttca agtaagtagt gfctgtgaggg gaatacctac ttgtactttc cccccaaacc 180 

agattcccga ggcttcttaa ggactcaagg acaatttcta ggcatttagc acgggactaa 240 

aaaggtctta gaggaaataa gaagcgccaa aaccatctct ttgcactgta tttcaaccca 300 

tttgtcctfcc tgggttttga aggaacaggt gggactgggg acagaagagt tcttgaagcc 360 

agtttgtcca tcatggaaaa tgagataggt gatgtggcta cgtcaggggg cccgaaggct 420 

ccttgttact gatttccgtc ttttctctct gccttttccc caagggccag g^cccctgga 480 

tctctgggca gagcagacgc aggcccctat aatagccctc atgctagaaa ggagccggag 540 

ccrgtgtata aggccagcgc agcctactct ggacagtgca gggttcccac tctcccaact 600 

ccccatctigc ttgcctccag acccacatitc acacacgagc cactgggttg gaggagcatc 66 0 

cgtgagatga aacaccattc tttcctcaat gtctcagcta tctaactgtg tgtgtaatca 720 

ggccaggtcc tccctgctgg gcagaaacca tgggagttaa gagattgcca acatttatta 780 

gaggaagctg acgtgtaact tctgaggcaa aatttagccc tcctttgaac aggaatttga 840 

ctcagtgaac cttgtacaca ctcgcactga gtctgctgct gatgatactg tgcaccccac 900 

tgtctgggtt ttaatgtcag gctgttcttt taggtatggc ggcttttccc tgggtgtcag 960 

taatactcaa gcacttcctc cgagtcaaga agttaatgat gccatcaaac aaatgaagaa 1020 

acacctaaag ctggccaagg taaaatatct atcgtaagat gtatcagaaa aatgggcatg 1080 

tagctgctgg gatataggag tagttggcag gttaaacgga tcacctggca gctcattgtt 114 0 

ctgaatatgt tggcatacag agccgtcttt ggcatttagc gatttgagcc agacaaaact 12 0 0 

gaattactta gttgtacgtt taaaagtgta ggtcaaaaac aaatccagag gccaggagct 1260 

gtggctcatg cctgtaatcc tagcactttg ggaggctgaa gcgggtggat cacttgaggt 1320 

caggagttcg agaccagcct ggcctacatg acaaaacccc gtatctacta aaaatacaaa 1380 

aaaattagct gggcttggtg gcacacacct gfcaatcccag ctacttggga ggctgaggca 1440 

ggagaattgc ttgaaccctg taggaagagg ttgtagtgag ccaagatcgc accgttgcac 1500 

tccagcctgg gcaacaagag caaaaetcca tctcaaaaaa caaattaaat ccagagattt 1560 

aaaagctctc agaggctggg cgcggtggct tacacctgtt atcccagcat tttgggatgc 162 0 

cgaggcgggc aaagcacaag gtcaggagtt tgagaccagc ctggccaaca tagtgaaacc 1680 

ctgtctctgc taaaaacata gaaaaattag ccgggcatgg tggcgtgcgc ctgtaatccc 174 0 

agctactcgg gaggctgagg tgagagaatt rcttgaaccc gggaggcgga ggttgcagtg 1800 

agcccagatt gcaccactgc actccagcct gggcgacaga gcaagactcc atctcaaaaa 1860 

aagctctcag aacaaccagg tttacaaatt tggtcagttg gtaaataaac tgggtttcaa 192 0 
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acatactttg ctgaaayaat cactgactaa ataggaaatg aatctttttt tttttttttt 1980 

taagctggca agctggtctg taggacctga taagtaetrca cttcatttct ctgtgtctca 2040 

ggtttcccat ttttaggtga gaattaaggg gctctgataa aacagaccct aggattgtgg 2100 

acagcagtga tagtcctaga gtccacaagt ctgcttttga gtgatgggcc catgtatctg 2160 

gcacatctgc aggcagagcg tggttctggc tcttcagatg atgccggtgg agcactttga 2220 

ggagtcctca ccccaccgtg ataaccagac attaaaatct tggggctttg catcccagga 2280 

tttctctgtg attccttcta gacttgtggc atcatggcag catcactgct gtagatttct 2340 

agtcacttgg ttctcaggag ccgtttattt aatggcttca catttaattt cagtgaacaa 2400 

ggtagtggca ttgctcttca cagggccgtc ctgttgtcca caggttccag attgactgtt 2460 

gccccttatc tatgtgaaca gtcacaactg aggcaggttt ctgttgttta caggacagtt 2S20 

ctgcagatcg atttctcaac agcttgggaa gatttatgac aggactggac accagaaata 2580 

atgtcaaggt aaaccgctgt ctttgttcta gtagcttttt gatgaacaat aatccttatg 2640 

tttcctggag tactttcaac tcatggtaaa gttggcaggg gcattcacaa cagaaaagag 2700 

caaactatta actttaccag tgaggcagta cggtgtagtg tagtgattca gagaatttgc 2760 

tttgccacca gacataccag gtaaccttga ctaagttact: taacctatct aaacctcagt 2820 

tycctcatct gtgaaatgga gacagtaatc atagctattt ccaaactgtt gtgagaattc 28 80 

aatgagttaa aggtataagg tcctcaccac agcgcctgcc cacatagtca gtgatcacta 294 0 

tgtcctgaac actgtaatta cttcgccata ttctctgatc atagtgtttt gccttggtat 3 000 

gtgactagaa tttctttctg aggtttatgg gcatggttgg tgggtatgca cctgcctgca 3 060 

ggagcccggt ttgggggcat taccttgtac ctggtatgtt ttctttcagg tgtggttcaa 33 20 

taacaagggc tggcatgcaa tcagctcttt cctgaatgtc atcaacaatg ccattctccg 318 0 

ggccaacctg caaaagggag agaaccctag ccattatgga attactgctt tcaatcatoc 324 0 

cctgaatctc acoaagcagc agctctcaga ggtggctctg taagtgtggc tgtgtctgta 330O 

tagatggagt ggggcaaggg agagggttat ggagaagggg agaaaaatgt gaacctcatt 33 60 

gtaggggaac agctgcagag accgttatat tatgataaat ctggattgat ccaggctctg 3420 

ggcagaagtg ataagtttac gaattggctg gttgggcttc ttgaactgca gaagagaaaa 3480 

tgacactgat atgtaaaaat cgtaacattt: agtgaattca tataaagtga gttcaaaaat 3540 

tgttaattaa attataattt aattataagt gtttaatcag tttgatttgt ttaaaaacca 3600 

ctgttttaaa tttggtggaa tatgttttta ttagcttgta tctttaattc ctaaattaag 3660 

ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gaagtttaaa 3720 

gccaggatga gctagtttaa agtatgcagc ctttggagtc atacagatct gggtttgaat 37 80 

ctggtctcta aactttatag atgtatgata ttaaatgagg cagttcatgt aaattgccaa 3840 

gcccagcact cagcacagag ttgatatttc acacacatta gatacctttc ctgtatgtgg 3900 

agcatggcag ttcctgtttc tgctttactc ctacaggata ctaatafcagg acactaggat 3960 

ctttatacca agaccecatg taatgggctt atgagaccat tcttcttata aaaatctgac 402 0 

agaatttttg tatgtgttag atcaataggc tgcatactgt tattttcaag ttgatttaca 4080 

gccagaaata ttaatttatt tgagtagtta cagagtaata tttctgctct catttagttt 414 0 

tcaagcccca ctagtccttt gtgtgtgaaa atttacaact tactgctett acaaggfccat 420 0 

gaacagtgga ccaaagtgaa tgccattaac cactctgacti tccttcatta gttttattgt 4260 

gacagtggac tcttttgacc tcagtaatac cagtttggca tttacattgt catattttta 4320 

gacttaaaaa tgatcatctt aaccctgaat aaaatgtgtc tggtgaacag atgtttttcc 43 8 0 

ttggctgtgc ctcagatatc tctgtgtgtg tgtacgtgtg tgtttgtctg tgtgtccatg 444 0 

tcctcactga ttgagcccta actgcatcaa agacccctca gattttcaca cgctttttct 4500 

ctccaggatg accacatcag tggatgtcct tgtgtccatc tgtgtcatct ttgcaatgtc 4560 

cttcgtcGca gccagctttg tcgtattcct gatccaggag cgggtcagca aagcaaaaca 462 0 

cctgcagttc atcagtggag tgaagcctgt catctactgg ctctctaatt ttgtctggga 4680 

tatggtaagg acacaggcct gctgtatctt tctgatgtct gtcagggcca tggattgata 474 0 

tggataagaa agaaagagct ctggctatca tcaggaaatg ttccagctac tctaaagatg 4800 

tatgaaaaag aaatagccag aggcaggt:ga tcactttcat gacaccaaac acagcattgg 4860 

gtaccagagt tcatgtcaca ccagagggaa aattctgtac acaatgatga aaattaatac 492 0 

cactaccact taagttccta tgtgacaact ttcccaagaa tcagagagat acaagtcaaa 4980 

actccaagtc aatgcctcta acttctctga tgggttttaa cctccagagt cagaatgttc 504 0 

tttgccttac taggaaagcc atctgtcatt tagaaaactc tgtacatttt atcagcagct SlOO 

tatccatcca ttgcaaatat tgtttttgtg ccasccacaa tatattgctt ctatttggac 516 0 

caatatgggg gatttgaagg aactctgaag ttctaattat atttcaactc tactttacaa 522 0 
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tatictccctg aaatatatct ccctgtaact tctattaatt ataagctaca cagagcaaat 5230 

ctaattcttc t:cccaccgaa caagtccctg gatatttaaa aataactctc atactctcat 5340 

ttaacctgag fcattacccag ataagatgat atatgagaat acaccttgta acctccgaag 5400 

cactgtacaa atgtgagcaa tgatggtgga gatgatgatg agatctttgc tgtttatacc 34 SO 

aagcccctta gactgtgtca ctcttctgat ccggttgtcc ttgtatggcc atgctgtata 5520 

ttgtgaatgt cccgttttca aaagcaaagc caagaattaa ccttgtgttc aggctgtggt 5580 

ctgaatggtt atgggtccag agggagttga tctttagctc acacttctat tactgcagca 5640 

caaagatttt: gcattttgga aggagcaccg tcttactggc aactitagtgg taaaccaaaa 5700 

cetccatttc acacaaatga ttgtgaaatt cgggtctcct tcattctata caaattcatt 5760 

tgattttttt gaaactaaac tttatattta tccatattaa attacatggg ttttattttt 5820 

gttttatcjtt gattcagtaa ttactccttt cagtaaacac agactgagtg ctgtgtgtct 5 880 

gacttatgcG aggcataggt gattcagaga tgaaaggtca agtccctgaa cccatctctt 5940 

gtcttcctgg gtattatctg tecctccctg ctttagagct cctgaaattt gctagaagca 6 COO 

tgtcttcatc taagttgttg ataaacacat caagtaggat tggactgagg cagagccctg 6060 

tagtctgaag ctgcagttct tctagcggct gacaagcccc actatcactt ccctgctggt 6120 

gctttgctct gccagctgtg aattctcata attgtcctat cgtcaagtet ttatttctgc 6180 

attttactgc ttgatacact gtcaggacag actttaaaat tattctcagt gcgatgaaac 6240 

aafctctgaca ttcatgttat gagcagttac ctcataaata gattacatg 6289 



<210> 27 

<211> 4244 

<212> DNA 

<213> Homo sapiens 

<400> 27 

aaattactct gactgggaat ccatcgttca gtaagtttac tgagtgtgac accttggctt 60 

gactgttgga aagacagaaa gggcatgtag tttataaaat cagccaaggg gaaaatgctt 120 

gtcaaaatgt attgtcgggt attfctgatta atagtttatg tggcttcatt aattcagagt 180 

tactctccaa tatgtttatc tgccctttct tgtctgataa tggtgaaaac ttgtgtgatg 240 

cattgtatat ttgatttagg ggtgaactgg atgtctttgt tttcactttt agtgcaatta 3 00 

cgttgtccct gccacactgg tcattatcat cttcatctgc ttccagcaga agtcctatgt: 3 60 

gtcctccacc aatctgcctg tgctagccct tctacttttg ctgtatgggt aagtcacctc 420 

tgagtgaggg agctgcacag tggataaggc atttggtgcc cagtgtcaga aggagggcag 430 

ggactctcag tagacactta tctttttgtg tctcaacagg tggtcaatca cacctctcat 540 

gtacccagcc tcctttgtgt tcaagatccc cagcacagcc tatgtggtgc tcaccagcgt 600 

gaacctcttc attggcatta atggcagcgt ggccaccttt gtgctggagc tgttcaccga 660 

caatgtgagt catgcagaga gaacactcct gctgggatga gcatctctgg gagccagagg 720 

acagtgttta attgtgatct tattccactt gtcagtggta ttgacactgc tgactgcctt 780 

gtcctgtctt: cagagtctgt cttccctgag aaggcaaagc acctttcttt cttgctgtgc 84 0 

cttacatttt gctggtcaag cctttcagtt tcttttgaca gtttttttta cttctttctt 900 

ttttcaatgt tgctcttacc aagagtagct cctctgcctt ccactttaca catgagagct 960 

gggcgacgca ttcagtccta aggcttttac catcacctct cttggtgttt ttattgtcat 1020 

ctctaaganc aatgccttta gccttgatca taaccttgaa ctctaatctc aaattctcac 10 8 0 

ttgcctagtg gattgctcca tttagatagc atatagatac cccaacctgg atatgtccta 114 0 

gttttctttc cccttggaac ttaatgcttt tcttgccatc cctgtcacac tcagtggcac 1200 

taccatccac tcggttgccc aagctggctrc ttagagttat cctagatgct tgctttgctg 1260 

ttgcagattt cccacattca actggttatg ttgtcagttc ttccaggtac ggacctctaa 132 0 

aataaggctt cctctccatt ccggtcgtca ttgcctttgt ccaaacacag cacacaaggc 13 80 

cttttacagt tgcacaactc ttcctgtcca tacecaccac accctttccc agctgtaagc 144 0 

ttcagatgag ttgcctccaa ccaccatgct cctgtaggcc tggcttgaaa tgcccttctt 1500 

ctgtcacagg gtctggtagt atatcccttg cccttcaaga tttagctaaa atgtgaagct 1560 

ttccttacct gctgggaggt gttctctctt ttctctgtgc tctcagagtc cttagtccat 1620 

gcctccagta caacgtacac ccacttacat ggtaatttcc tgtttacata cttttcctac 1680 
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tcggagtgga gtctgtttct taataafcttt gcctctccca tgccctagca cagtgcatcc 174 0 

agcgtatagc cccttafctca gttggtagat atttggccac tgttgccttg tgggatcata 1800 

agttctgatg tatttgagaa gaatttctaa aattctgaca aaatcctgaa actcaaatat i860 

tgacccagac atgagcaattr tgcttttcaa atgctaaggg atttttaatg gatttgcttt 192 0 

aattaaatct agcctgtttc taagctttat tcattatttc tccatactca gagcatttct 1980 

ccagattttc taaagaatiag aattttattg ctacatatca tcagctatgc ctgctgctat 2040 

ttaattggta tctgaattaa aaggtctggt ttgtccctag agaatcaaat tttttcttca 2100 

ctCGcatatt tcagaacttg atacattttt aggataaacc atgaatgaca cccgtttctt 2160 

ctccctcacc ctcccttccc tcccattttt tttttttttt ttttttagaa gctgaataat 2220 

atcaatgata tcctgaagtc egtgttcttg atcttcccac atttttgcct gggacgaggg 2280 

ctcatcgaca tggtgaaaaa ceaggcaatg gctgatgccc tggaaaggtt tggtgagtga 2340 

ageagtggct gtaggatgct ttaatggaga tggcactctg cataggcctt ggtaccctga 2400 

actttgtttt ggaaagaagc aggtgactaa gcacaggatg ttcccccacc cccatgccca 2460 

gtgacagggc tcatgccaac acagctggtt gtggcatggg tcttgtgaca caaccatttg 252 0 

tctgtgtctc tgacagcatt gagaaaagtg aaagggcagt tttgaaggta aggaaaatag 23 80 

tgfctatttgc ttggatccac tggctcatgc cactgtctgg gttggttaga agcactggaa 264 0 

aagtcaaacc ataactttga gaattaggtg atcagggaat cagaaggaaa gatgcaaact 2700 

ttggctcttt taggcgaatc atgtgcctgc agatgaggtc atttattatc ttttacacag 2760 

tetataaaat tataatgtat tacatetttt tctaccttta gaatggttaa aaatatttct 2820 

ccggtagcca tatgattatt attcatccat tagataatat agtcaaatgg gecatgttat 2880 

ctactgttca tagaagaggg gctttttgca acttgggcta caaaggagat atgtaaggaa 2 94 0 

tttaaggaat ggttacatgg aactagattt aattgaatct agtggtttaa ttgattcact 3 000 

aggatatatg ctactgaaag gggaatctgc ttaaagtgct ttctgatatt tattattact 3060 

aaaacttaga atttattaaa aatactgact gtgaaaatta cttgggtcgt ttgccttttt 3120 

aaaaggattt ttggcatgtc tcattaaaaa aagaaatact agatatcttc agtgaagtta 3180 

caaatcgaat acacattggc tctgaaattc tgattgatac tgggtcataa aaagttttcc 3 24 0 

caaatcagac ttggaaagtg atcactctct tgttactctt ttttccttgt catgggtgat 33 00 

agccatttgt gtttattgga agatcggtga attttaagga acataggccc aaatttgagg 3 3 60 

aagggccatg gtttttgatc cctccattct gaccggatct ctgcattgtg tctactaggg 3420 

gagaatcgct ttgtgtcacc attatcttgg gacttggtgg gacgaaacct cttcgccatg 3480 

gccgtggaag gggtggtgtt cttcctcatt actgttctga tccagtacag attcttcatc 3540 

aggcccaggt: gagctrttttc ttagaacccg tggagcacct ggttgagggt cacagaggag 3600 

gcgcacaggg aaacactcac caatgggggt tgcattgaac tgaactcaaa atatgtgata 3 660 

aaactgattt tcctgatgtg ggcatcccgc agccccctcc ctgcccatcc tggagactgt 372 0 

ggcaagtagg ttfctataata ctacgttaga gactgaatct ttgtcctgaa aaatagtttg 3 780 

aaaggttcat ttttcttgtt ttttccccca agacctgtaa atgcaaagcit atctcctcfcg 3 840 

aatgatgaag atgaagatgt gaggcgggaa agacagagaa ttcttgatgg tggaggccag 3 900 

aatgacatct tagaaatcaa ggagttgacg aaggtgagag agtacaggtt acaatagctc 3 960 

atcttcagtt ttfcttcagct ttatgtgctg taacccagca gtttgctgac ttgcttaata 4020 

aaagggcatg tgttcccaaa atgtacatct ataccaaggt tctgtcaatt ttattttaaa 4080 

aacaccatgg agacfctctta aagaattcct actgagaatt cttttgtgat atgaattccc 4140 

attctcgaat actttggttt tatatgctta catttatgtg ttagttatta aaacatacta 4200 

atattgtata tctagtcaaa ctgagtagag agataatggt gatt 4244 

<210> 28 
<211> 5023 
<212> DNA 

<213> Homo sapiens 
<400> 2a 

ttttaaaata cctgcaatac atatatatgt tgaatagatg aaaaattatg tagatgataa 60 

tgaatgatac ggttctaaaa agacaggtta aaaagtaagt tcacttfctat tttgagcttc 120 

agaatcattc agaagccagt cgccacaaac gcagaccaag gctctrggca catcaaatat 180 

gcctatggct tagggttatt gacaagtctt atgttgcagt gtatgtggtt tatagtcctg 24 0 

ccttccacag ttgcttggga gagctgtgag tcactgaggc ttatgaatgt ttacattttg 300 
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tttgttgcag atatatagaa ggaagcggaa gcctgctgtt gacaggattt gcgtgggcat 360 

tcctcctggt gaggtaaaga cactttgtct atattgcgtt tgtccctatt agttcagact 420 

atctctaccc aatcaagcaa cgatgctcgt taagaggtaa aagtggattt taaaggcttc 480 

tgtatttatg ccaggatgga gcaattagtc atcgagaaga gagggaccct gtatgtcaag 540 

agaat:gattt cagagaatqc aatacaattt aagaaaaagc atggggctgg gcgcagtgat 600 

tcactcctgt aatcccagca ctttgggagg ccgaggtggg cggactcacg aggtcaggag 660 

attgagacca tcctggccaa catggtgaaa ccccatictct actataaata caaaaattag 720 

ctgggcafcag tagtgcattc ctgtagtccc agctactcgg gaggctgagg caggagaatt 780 

gcttgaacct aggaggggga ggttgcccag attgcgctgc tgcacfcccag cctggtgaca 84 0 

gagtgagact catgtcaacd acaaaaacag aaaaagcacg cacatctaaa acatgctttt 900 

gtgatccatt cgggatggtg aUgacattca aatagt:tt:tt taaaaataga ttttctcctt 960 

tctggtttcc gtttgtgttc ttttatgcce ttttgccaga gtaggtggtg caatttggct 1020 

agctggcttt cattactgtt tttcacacat taactttggc ctcaacttga caactcaaac 1080 

aatiatttata aatacagcca cacttaaaat ggtcccatta tgaaatacat atttaaatat: 1140 

ctatacgatg tgttaaaacc aagaaaatat ttgattcttc tctgatattt aagaattgaa 1200 

ggtttgaggt agttaegtgt taggggcatt tatattcatg tttttagagt ttgcttatac 1260 

aacttaatct ntccttttca gtgctttggg ctcctgggag ttaatggggc tggaaaatca 1320 

tcaactttca agatgttaac aggagatacc actgttacca gaggagatgc tttccttaac 1380 

aaaaataggt gagaaaagaa gtggcttgta ttttgctgca aagactttgt ttttaattta 1440 

tttaaagaaa taggttgtta tttttgatfca cagtggtatt tttagagttc ataaaaatgt 1500 

tgaaatatag t:aaagggtaa agaagcacat aaaatcatcc atgatttcaa tatctagaga 1560 

taatcacaat ttacatttcc tttcagtctc attctcttct tttaacagct ttattcaggt 1620 

ataatttaca tacaatataa tttgcttgtt ttttaagagt ataatttagt gatttttggt 1680 

aaattgagag ttttgcaacc atcaccacaa tccagtttta gaact:tttcc atcaccccac 1740 

atctgtctta tatacacat:a taaatgtgcc atacaat:t:ga gatcatactg tatgtagaat 1800 

ttaaaattag tttttattgt taatgagtgt attatgaata tttcccagtg ggttacattt i860 

cctaagatgt ggaattttac attgctacat aaaatccccc tatgtacatg tacctataat 1920 

ttatttaata aattccttat aaatgttgga cacattagtt tccatttttc actatgtaaa 1980 

tatgtccctg tatacatctt ttattatttc ctcaggaaca attcctacaa agtaaattgc 204 0 

cctctctaaa gagcatacaa attgactgag ccaccgttag gccattttct gagactgcac 3100 

aggtcacaaa gcaatctgat ctttgggaat acagctacat tttataggct tcttagataa 2160 

tgttactcta agtactttaa atatgtgggg cttctctggg cttttttttt tttgagacgg 2220 

agfcttcactc ttactgccca ggctggagag caatggcgcg accttggctc actgcaacct 2280 

ccgcctccca ggttcaagcg attctcctgc ctcagcctcc tgagtagctg agattacagg 2340 

tgcccgccac aatgcctgcc taattttttt gtattttcag tagagatggg gtttcaccat 2400 

gttggccaga ctggtctcga gctcctgacc tcaggtgatc cacctgcctc agcctcccaa 24 60 

agttctggga ttacaggcat gagtscactgc gcccggcttc tctggactta ttatgtggag 2520 

agatagtaca aggcagtggc tttcagagtt ttttgaccat gaccgttgtg ggaaatacat 2580 

tttatatctc aacctagtat gtacacacag acatgtagac acatgfcataa cctaaagttt 2640 

cataaagcag tacctactgt tactaattgt agtgcactct gctatttctt attctacctt 2700 

atactgcgtc attaaaaaag tgctggtcat: gacccactaa atttatttcc caaaccacta 2760 

atgaacaatg actcacaatt tgaacacact ggacaggggg atagcaaata aaattgaaaa 2820 

gagcaaggaa attaatgtat tcatgatctc ctctcctgtc tcttacattt ttgcagtagc 2880 

aatgtaaagg aatcctaaga gaacagacat tctgggaata gcaggcctag cgctgcacaa 2940 

ctgctttcct aggcttgcfcc ctagtaccaa gctcctgacg catatagcag tggcagtaat 3000 

aaccagccca tagtaaggtt tgtcacaggg actggttgta agaactgatt tgrttggtat 3060 

agctgtgagg gcctggcacg gtgtccacgt gtgcctcaat cctaattctg aaaaaggctg 3120 

accctggggg tgctaattag atacacagag aggaatgaat gctgccagaa ggccaagttc 3180 

atggcaatgc cgctgtggct gaggtgcagt catcagtctg gaacgtgaac actgaacttc 324 0 

tctcacatgt gattcttcac ttgactggct tcatagaacc ccaaagccac cccaccacca 3300 

cataaattgt gtctctaggt tctgtgttgc tcacactcaa aatttctggg ccttctcatt 3 36C 

tggtgcatgt gaatggtgca tatgagtgaa gtctaggatg gggccttagc gttaaagccc 3420 

tggggt-agtg tgactgagat tgttggtaaa gaatgtgcag tggttggcat gacctcagaa 3480 

attctgaaat gggactgcac ctgcagactg aagtgttcag agagccaggg aggtgcaagg 354 0 

actggggagg gtagaggcag gaaccctgcc tgccaggaag agctagcatc ctgggggcag 3600 



wo 00/55318 



PCT/lBOO/00532 



38 



aaaggctgtg ctttcaagta gcagcagatg tattggtatc tttgtaatgg agaagcatac 3660 

tttacaggaa cattaggcca gattgtctaa ccagagtatc tcticctgct taaaatctaa 3720 

gtagttttct tgtcctttgc agtatcttat caaacatcca tgaagtacat cagaacatgg 3780 

gctactgccG tcagtttgat gccatcacag agctgttgac tgggagagaa cacgtggagt 3840 

tctttgcccfc tttgagagga gtcccagaga aagaagttgg caaggtactg tgggcacctg 3900 

aaagccagcc tgtctccttt ggcatcctga caatatatac cttatggctt ttccacacgc 3960 

attgacttca ggctgttttt cctcatgaat gcagcagcac aaaatgctgg ttctttgtat 4020 

ctgctttcag ggtggaaacc tgtaacggtg gtggggcagg gctgggtggg cagagaggga 4080 

gtgctgcJtcG caccacacga gtcccttctc cctgctttgg ctcctcacca gttgtcaggt 414 0 

tatgattata gaatctagtc ctactcagtg aaagaacttt cataeatgta tgtgtaggac 4200 

agcatgataa aattcccaag ccagaccaaa gtcaaggtgc tttttatcac tgtaggttgg 4260 

tgagtgggcg attcggaaac tgggcctcgt gaagtatgga gaaaaatatg ctggtaacta 4320 

tagtggaggc aacaaacgca agctctctac agccatggct ttgatcggcg ggcctcctgt 43 80 

ggtgtttctg gtgagtataa ctgtggatgg aaaactgttg ttctggcctg agtggaaaac 444 0 

atgactgttc aaaagtccta tatgtccagg gctgttgtat gattggcttg tcttccccca 4500 

gggacagcag agcaaccttg gaaaagcaga gggaagcttc tcccttggca cacactgggg 4S60 

tggctgtacc atgcctgcag atgctcccaa atagaggcac tccaagcact ttgtttctta 462 0 

gcgtgattga ggctggatat gtgatttgat ctttctctgg aacattcttt ctaatcatct 4680 

ttgtgttcat tccctgaaaa tgaagagtgt ggacacagct ttaaaatccc caaggtagca 474 0 

actaggtcat agttccttac acacggatag atgaaaaaca gatcagactg ggaagtggcc 4 300 

cttgaccttt tttcttctgt agataagagc attgatgtta ttacgggaag aagcctttga 436 0 

ggcttttatg tattccacct cggtctggaa tttgtttctg taaggctaac agttgcaata 492 0 

tactagggta atctgagtga gctggaatta aaaaaaaaaa ggaatttcac cccaatctta 49B0 

tactgacttc aatagaggtt tcagac3aaaa agttgttttg tat 502 3 



<210> 29 

<211> S138 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> miac_f eature 
<222> (X),..(S13a) 

<223> n » a, t, c, or g 

<400> 29 

ngccnngttn aaaangaaaa tttnnnnnaa attnaanntt annggngnim tttccccaga 60 

aaaaacnaaa angatttccn cccngggggg ncccccnant cnaaaaggcc ccncttnttt 120 

gnggngaggg aaagnttttt ttggaatttt taatttttgg tcccccaaaa cctattattg 180 

agaatttaat tacataaaaa agtactcaga atatttgagt ttcctgcatc aataagacat 240 

ttataataat gaccttgttt acaaatgaat tfcgaaagtta ctctaattct ttgattcatc 300 

aagaaataac tagaatggca agttaaaatt taagctgttt caaagatgct tctgcattta 3 60 

aaaacaaatt tatctttgat tttttttccc cccagcaaat aagacttatt ttattctaat 420 

tacaggatga acccaccaca ggcatggatc ccaaagcccg gcggttcttg tggaattgtg 4 80 

ccctaagtgt tgtcaaggag gggagatcag tagtgcttac atctcatagg tccgtagtaa S4 0 

agtcttgggt tcctcactgt gggatgtttt aactttccaa gtagaatatg cgatcatttt 600 

gtaaaaatta gaaaatacag aaaagcaaag agtaaaacaa ttattacctg aaattatata 660 

tgcatattct tacaaaaatg caagcccagt ataaatactg ctctttttca cttaatatat 72 0 

tgtaaacatt attccaagtc agtgcattta ggtgtcattt cttatagctg gatagtattc 780 

cattaggata tactcttatt taactattcc cccttttgta gacatttgga ttatttccaa 84 0 

cttgttcaca attgtaaaca ccactacact gaacagcatc atccctatat ccacatgtac 900 

ttgtaacaga atacaattcc ctaggaagct ggaatgctgg aagtcatggt gatgttctca 960 

tggttacaga gaatctctct aaaactaaaa cctctttctg ttttaccgca gtatggaaga 1020 
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atgtgaagct ctttgcacta ggatggcaat catggtcaat ggaaggttca ggtgcettgg 1080 

cagtgtccag catctaaaaa ataggtaata aagataattt ctttgggata gtgcctagtg 1140 

agaaggcttg atatttattc ttttgtgagt atataaatgg tgcctctaaa ataaagggaa 12 00 

ataaaactga gcaaaacagt anagtggaaa gaatgagggc tttgaagtcc gaactgcatt 12 SO 

caaattctgt ctttaccatt tactggttct gtgactcttg ggcaagttac ttaactactg 1320 

taagagttag tttccctgga agatctacct cctagctttg tgctatagat gaaatgaaaa 13 80 

aaatttacat gtgccagtac tggtgagagc gcaagctttg gagtcaaaca caaatgggtt 1440 

tgcatcctgg ccctaccaat tatgagctct gagccatggg caagtgacta actccctggg 1500 

cctcagtttc tctgtaacat ctgtcagact tcatgggtcc aggtgaggat taaaggagat 1560 

catgtattta cagcacatgg catggtgctt cacataaaat aagtatttag taaatgataa 1620 

ctggttectt ctctcagaaa cttatttctg ggcctgccag gggccgccct ttttcatggc 168 0 

acaagttggg ttcccagggt tcagtattct tttaaatagt tttctggaga tcctccattt 1740 

gggtattttt tcctgctttc aggtttggag atggttatac aatagttgta cgaatagcag 1800 

ggtccaaccc ggacctgaag cctgtccagg atttctttgg acttgcattt cctggaagtg 1860 

ttcyaaaaga gaaacaccgg aacatgctac aataccagct tccatcttca ttatcttctc 1920 

tggccaggat attcagcatc ctctcccaga gcaaaaagcg actccacata gaagactact 1980 

ctgtttctca gacaacactt gaccaagtaa gctttgagtg tcaaaacaga tttacttctc 2040 

agggtgtgga ttcctgcccc gacactcccg ccscataggcc caagagcagt ttgtatcttg 2100 

aattggtgct tgaattcctg aCctactatt cctagctatg ctttttacta aacctctctg 21S0 

aacctgaaaa gggagatgat gcctatgtac tctataggat tattgtgaga atttactgta 2220 

ataataacca taaaaactac catttagtga gcacctacca tgggccaggc attttacttg 22 80 

gtgcctaatc ctatttaaat tagataaaaa agtaccaaat aggtcctgac aettaagaag 2340 

tactcagtaa atattttctt ccctcttccc tttaatcaag accgtatgtg ccaaagtaaa 2400 

tggatgactg agcagttggt gatgtagggg tggggggcga tatagaaagt cagtttttgg 2460 

ccgggcgtgg tggcteatgc ctgtaatccc agcactttgg gaggctgagg agcaggcaga 2520 

tcatgaggtc aggagatcca gataatcctg gccaacaggg tgaaaccccg tctctactaa 2580 

aaatacaaaa attagctggg catggtggtg cgcacttgta gtcccagcta cttgcgaggc 2640 

tgaggcagga gaattgctcg aacccaggag gtggaggtta cagtgagcca aggtctcgcc 2700 

actgcactcc agcctgggga cagagcaaga ccccatttca aggggggaaa aaaagtctat 2760 

ttttaagttg ttattgcttt tttcaagtat tcttccctcc ttcacacaca gttttctagt 282 0 

taatccattt atgtaattct gtatgctcct acttgaccta atttcaacat ctggaaaaat 2880 

agaactagaa taaagaatga gcaagttgag tggtatttat aaaggtccat cttaatcttt 2 940 

taacaggtat ttgtgaactt tgccaaggac caaagtgatg atgaccactt aaaagacctc 3000 

tcattacaca aaaaccagac agtagtggac gttgcagttc tcacatcttt tctacaggat 3060 

gagaaagtga aagaaageta tgtatgaaga atcctgttca tacggggtgg ctgaaagtaa 3120 

agaggaacta gactttcett tgcaccatgt gaagtgttgt ggagaaaaga gccagaagtt 3180 

gatgtgggaa gaagtaaact ggatactgta ctgatactat tcaatgcaat gcaattcaat 3240 

gcaatgaaaa caaaatteca ttacaggggc agtgccttng tagcctatgt cttgtatggc 3300 

tctcaagtga aagacttgaa tttagttttt tacctatacc tatgtgaaac tctattatgg 33 60 

aacccaatgg acatatgggt ttgaactcac actttttttt ttttttttgt tectgtgtat 3420 

tctcattggg gttgcaacaa taattcatca agtaatcatg gccagcgatt attgatcaaa 3480 

atcaaaaggt aatgcacatc ctcattcact aagccatgcc atgcccagga gactggtttc 3540 

ccggtgacac atceattgct ggcaatgagt gtgccagagt tattagtgcc aagtttttca 3 600 

gaaagtttga agcaccatgg tgtgtcatgc tcacttttgt gaaagctgct ctgctcagag 3 660 

tctatcaaca ttgaatatca gttgacagaa tggtgceatg cgtggctaac atcctgcttt 3 720 

gattccctct gataagctgt tctggtggca gtaacatgca acaaaaatgt gggtgtctcc 3 78 0 

aggcacggga aacttggttc cattgttata ttgtcctatg cttcgagcca tgggtctaca 3 840 

gggtcatcct tatgagactc ttaaatatac ttagatcctg gtaagaggca aagaatcaac 3 900 

agccaaactg ctggggctgc aactgctgaa gccagggcat gggattaaag agattgtgcg 3 960 

ttcaaaccta gggaagcctg tgcccatttg tcctgactgt ctgctaacat ggtacactgc 4 02 0 

acctcaagat gtttatctga cacaagtgta ttatttctgg ctttttgaat taatctagaa 4080 

aatgaaaaga tggagttgta ttttgacaaa aatgtttgta ctttttaatg ttatttggaa 414 0 

ttttaagttc tatcagtgac ttctgaatcc ttagaatggc ctctttgtag aaccctgtgg 42 0 0 

tatagaggag tatggccact gcccactatt tttattttct tatgtaagtt tgcatatcag 4260 

tcatgactag tgcctagaaa gcaatgtgat ggtcaggatc tcatgacatt atatttgagt 432 0 
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ttctttcaga tcatttagga tactcttaat ctcacttcat caatcaaata ttttttgagt 4380 

gtatgctgta gctgaaagag tatgtacgta cgtataagac tagagagata ttaagtctca 444 0 

gtacacttcG tgtgccatgt tattcagctc actggtttac aaatataggt tgtcttgtgg 450 0 

ttgtaggage ccactgtaae aatactgggc agcctttttt tttttttttt taattgcaac 4560 

aatgcaaaag ccaagaaagt ttaagggtca caagtctaaa caatgaattc ttcaacaggg 4S20 

aaaacagcta gcttgaaaac ttgctgaaaa acacaacttg tgtttatggc atttagtacc 4680 

ttcaaataafc tggctttgca gatattggat accccattaa atctgacagt ctcaaatttt 4740 

tcatctctitc aatcactagt caagaaaaaa tataaaaaca acaaatactt ccatatggag 4800 

catttttcag agtttfcctaa cccagtctta tttttctagt cagtaaacat ttgtaaaaat 4860 

actgtttcac taatacttac tgttaactgt cttgagagaa aagaaaaata tgagagaact 492 0 

attgtttggg gaagttcaag tgatctttca atatcattac taacttcttc cactttttcc 4 980 

agaatttgaa tattaacgct aaaggtgtaa gacttcagat ttcaaattaa tctttctata 504 0 

ttttttaaat ttacagaata ttatataacc cactgctgaa aaagaaacaa atgattgttt 5100 

tagaagttiaa aggtcaatat tgattittaaa atattaag 513 8 



<210> 30 

<211> 20 

<212> D3DIA 

<213> Homo sapiens 

<400> 30 

gtgttcctgc agagggcatg 20 

<210> 31 

<:211> 20 

<212> DNA 

<:213> Homo sapiens 

<400> 31 

cacttccagt aacagctgac 2 0 

<21Q> 32 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 32 

ctttgcgcat gtccttcatg c 22 

<210> 33 

<21i> 21 

<212> DNA 

<213> Homo sapiens 

<400> 33 

gacateagcc ctcagcatct t 21 

<210> 34 

<211> 19 

<212> DNA 

•c213> Homo sapiens 



<400> 34 
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caacaagcca tgttccctc X9 

<210> 3S 

<211> 18 

<212> DWA 

<2X3> Homo eaplens 

<400> 35 

catgttccct cagccagc la 

c210> 36 

<211> X9 

<212> DNA 

<2X3> Homo sapiens 

<400> 36 

cagagctcac agcagggac 19 

<210> 37 
<211> 21 
<212> PRT 

<213> Homo sapiens 



<400> 37 

Cys Ser Val Arg Leu Ser Tyr Pro Pro Tyr Glu Gin His Glu Cya His 

15 10 15 

Phe Pm Asn Lys Ala 
20 



<210> 38 

<211> 14 

<212> DNA 

<213> Homo sapiens 



<400> 38 

gcctgtgtgt cecc 14 

<210> 39 

<211> 14 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis Cofeature 
<222> (1) .7. (14) 
<223> n a t or c 



<400> 39 

gcctgtgngt cccc 14 

<210> 40 

<211> 45 

<212> DNA 

<213> Homo sapiens 
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<400> 40 

aagaagatgc tgcctgtgtg tcccccaggg gcaggggggc trgcat 45 

<210> 41 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 41 

Lys Lys Met Leu Pro Val Cya Pro Pro Qly Ala Gly Gly Leu Pro 
15 10 15 



<210> 42 

<211> 15 

<212> PRT 

<213> MU3 rausculus 



<400> 42 

Lya Lys Met Leu Pro Val Cys Pro Pro Gly Ala Gly Gly Leu Pro 
1 5 10 IS 

<210> 43 

<211> 15 

<212> PRT 

<213> Homo sapiens 



<400> 43 

Lys Lys Met Leu Pro Val Arg Pro Pro Gly Ala Gly Gly Leu Pro 
1 5 10 15 



<210> 44 
<211> 5 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 44 

Leu Leu Gly Gly Ser 
1 5 

<210> 45 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 45 

aagaagatgc tgcctgtgcg tcccccaggg gcaggggggc tgcct 45 



<210> 46 

<211> 14 

<212> DNA 

<213> Homo sapiens 



<400> 46 
gcctacttgc agga 



14 
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<210> 47 
<211> 14 
<212> DNA 

<213> Homo sapiens 
<400> 47 

gcctacttgc ggga 14 

<2X0> 48 
<211> 45 
<212> DNA 

<213> Homo sapiens 

<400> 48 

tgggggggct: tcgcctactt gcaggatgtg gtggagcagg caatc 4S 

<2L0> 49 
<211> 15 
<2L2> PRT 

<213> Homo sapiens 
<4Q0> 49 

Trp Gly Gly Phe Ala Tyr Leu Gin Asp Val Val Glu Gin Ala lie 
1 5 10 15 

<210> 50 
<211> 15 
<212> PRT 

<213> Mus musculus 
<400> 50 

Trp Gly Gly Phe Ala Tyr Leu Gin Asp Val Val Glu Gin Ala He 
1 5 10 IS 

<210> 51 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<460> SI 

Trp Gly Gly Phe Ala Tyr Leu Arg Asp Val Val Glu Gin Ala He 
1 5 10 15 

<210> 52 
<211> 12 
<212> PRT 

<213> Caenorhabditis elegans 
<;400> 52 

Phe Met Thr Val Gin Arg Ala Val Asp Val Ala He 
1 5 10 



c21G> 53 
<211> 45 
<212> DNA 
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<213> Homo sapiens 
<400> 53 

tgggggggct tcgcctactt gcgggatgtg gtggagcagg caatc 45 

<210> 54 

<211> 2S 

<212> DNA 

<213> Homo sapiens 

<220> 

<22X> misc^featiare 

<222> (1) . . . <25) 

<223> n is a, t, or g. 

<400> S4 

tcattcctct tgtimgcncn giincn 25 

<210> 55 
<211> 45 
<212> DNA 

<213> Homo sapiens 
<400> 55 

agtagcctca ttcctcttct tgtgagcgct ggcctgctag tggtc 4 5 

<210> 56 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 56 

Ser Ser Leu lie Pro Leu Leu Val Ser Ala Gly Leu Leu Val Val 
IS 10 15 

<210> 57 
<211> 15 
<212> PRT 

<213> Mus musculus 
<400> 57 

Ser Ser Leu He Pro Leu Leu Val Ser Ala Gly Leu Leu Val Val 
1 5 10 15 

<210> 58 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 58 

Ser Ser Leu He Pro Leu Val Ser Ala Gly Leu Leu Val Val 
1 5 XO 



<210> 59 
<2X1> 15 
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<212> PRT 

<213> Caenorhabditis elegaas 



<400> 59 

lie Asn Tyr Ala Lyg Leu Thr Phe Ala Val lie Val Leu Thr lie 
1 5 10 IS 



<210> 60 
<21X> 42 
<212> DNA 
<213> Homo 



sapiens 



<400> 60 

agtagcctca ttcctcttgt gagcgctggc ctgctagtgg tc 42 

<210> 61 

<211> 25 

<212> DWA 

<213> Homo sapiens 



<220> 

<221> misc^^f eature 

<222> (1) . . • (25) 

<223> n is a, t, c, or g. 

<400> 61 

tgatgaagat ganarmcngn ngcga 25 

<210> 62 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<4Q0> 62 

aafcgatgaag atgaagatgt gaggcgggaa agacag 36 



<210> 63 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 63 

Aan Asp Glu Asp Glu Asp Val Arg Arg Glu Arg Gin 
1 5 10 



<210> 64 
<211> 12 
<212> PRT 

<213> Mus rauBCulus 
<400> 64 

Asn Asp Glu Asp Glu Asp Val Arg Arg Glu Arg Gin 
1 5 10 



<210> 65 
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<211> 10 

<212> PUT 

<213> Homo sapiens 

<400> 65 

Aan Aap Glu Asp Val Arg Arg Qlu Arg Gin 
I S 10 

<210> 66 
<211> 15 
<212> PRT 

<213> Caenorhabditis elegans 
<4oo> 

Asp Glu Arg Asp Val Glu Asp Ser Asp Val He Ala Glu hys Ser 
15 10 IS 

<210> 67 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 67 

aatgatgaag atgtgaggcg ggaaagacag 

<210> 68 

<211> 14 

<212> DNA 

<213> Homo sapiens 

<400> 68 
agttgtacga atag 

<210> 69 

<211> 14 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) , , . (14) 
<223> n is t or c. 

<400> 69 
agttgtanga atag 

<210> 70 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<:400> 70 

ggctggatta gcagtcctca 



<210> 71 
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<211> 20 

<212> DHA 

<213> Homo sapiens 



<40a> 71 

ggatttccca gatcccagtg 



20 



<210> 72 

<211> 20 

<212> DKA 

<213> Homo sapiens 

<400> 72 

gacagacttg gcatgaagca 20 

<210> 73 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 74 

<211> 20 

<212> DWA 

<213> Homo sapiens 

<400> 74 

cgtttctcca ctgtcccatt 2 0 

<210> 75 

<21X> 20 

<212> DITA 

<213> Homo sapiens 



<2X0> 76 

<211> 24 

<212> DNA 

<2X3> Homo sapiens 

<400> 76 

tcggtttctt gtttgttaaa ctca 2 4 

<2X0> 77 
<21X> 2 0 
<212> DNA 

.<2X3> Homo sapiens 



<400>. 73 

gcacttggca gtcacttctg 



20 



<400> 75 

acttcaagga cccagcttcc 



20 



<400> 77 

tcccaaggct ttgagatgac 



20 
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<21Q> 78 

<211> 19 

<212> DMA 

<213> Homo sapiens 

<400> 78 

ggctccaaag eccttgtaa 

<210> 79 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 79 

gctgctgtga tggggtatct 

<210> 80 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 80 

tttgtaaatt ttgcagtgct cctca 

<210> 81 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 81 

tagtcagccc ttgcctccta 

<2X0> 82 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<40O> 82 

aaaggggctt ggtaagggta 

<210> 83 

<211> 20 

<212> 0NA 

<213> Homo sapiens 

<400> 83 

gatgtggtgc tccctctagc 

<210> 84 

<211> 20 

<212> DMA 

<213> Homo sapiens 

<400> 84 

caagtgagtg cttgggattg 
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<210> 85 

<211> 21 

<212> DTSSh 

<213> Homo sapiens 

<400> 85 

gcaaat:t:caa atttctccag g 21 

<210> as 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 87 

<211> 20 

<212> Dm 

<21.3> Homo sapiens 

<400> 87 

ctgaaagttc aagcgcagtg 20 

<210> 88 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 88 

tgcagactga atggagcatc 20 

<210> 89 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 90 

<211> 20 

<212> DMA 

<213> Homo sapiens 

<400> 90 

aggtcctctg ccttcactca 20 

<210> 91 

<:211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 86 

tcaaggagga aat:ggacct:g 



20 



<:400> 89 

gccaggggac actgtattct 



20 



<400> 91 

Gcagtgctta cccctgctaa 



20 
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<210> 92 
<211> 21 
<212> D'SSA 

<213> Homo sapiens 
<400> B2 

cacacaacag agcttcttgg a 21 

<210> 93 

<211> 20 

<212> DN*A 

<213> Homo sapiens 



<210> 94 

<211> 21 

<212> DWA 

<213> Homo sapiens 

<400> 94 

gggctaacat gccactcagt a 21 

<210> 95 

<211> 20 . 

<212> DWA 

<2X3> Homo sapiens 



<210> 96 

<211> 20 

<212> DWA 

<213> Homo sapiens 

<400> 96 

caccagaaga aggagcatgg 2 0 

<210> 97 

<21l3. 20 

<212> DWA 

<213> Homo sapiens 



<400> 93 

acctggaaca ggtgtggtgt 



20 



<4Q0> 95 

gtttgttgca gatggggaag 



20 



<400> 97 

ctggactcgt agggatttgc 



20 



<210> 98 
<211> 21 
<212> DNA 



<213> Homo sapiens 



<400> 98 

gcctgtcaca gagaaatgct t 



21 
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<210> 99 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 99 

ttacggaatg atcctgtgct c 21 

<210> 100 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 101 

<211> 22 

<212> DNA 

<2X3> Homo sapiens 

<400> 101 

ccgttcctta tatcctcagg tg 22 

<210> 102 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<210> 103 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<40a> 103 

tgttgtccac aggttccaga 20 

<210> X04 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 104 

tgaggtttat gggcatggtt 20 

<210> 105 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 100 

agteaggttt ccggtcacac 



20 



<400> 102 

ccttgtacac actcgcactg a 



21 



<400> 105 

atgtttttcc ttggctgtgc 



20 
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<210> 106 

<211> 20 

<212> DNA 

<2X3> Homo sapiens 



<40O> lOS 

atctgccctt tcttgtctga 



20 



<2ao> X07 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 107 

agggagctgc acagtggata 



20 



<210> 108 

<211> 24 

<21-2> DNA 

<2X3> Homo sapiens 



<40O> 108 

tcactcccat atttcagaac ttga 



24 



<210> 109 

<211> 22 

<212> DHA 

<213> Homo sapiens 



<400> 109 

tgtttattgg aagatcggtg aa 



22 



<210> 110 

<211> 25 

<212> DMA 

<213> Homo sapiens 



<4ao> 110 

cgttagagac tgaatctttg tcctg 



25 



<210> 111 

<211> 20 

<212> DMA 

<213> Homo sapiens 



<400> 111 

agtcctgcct tccacagttg 



20 



<210> 112 
<211> 21 
<212> DMA 

<213> Homo sapiens 



<400> 112 

ggtagttacg tgttaggggc a 



21 
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<210> 113 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> X13 

caggaacatt aggccagatt g 

<210> 114 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 114 

catgtatgtg taggacagca tga 

<210> lis 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 115 

ctgttfccaaa gatgcttctg c 

<210> 115 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> lis 

cctaggaagc tggaatgctg 

<210> 117 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 117 

gggttcccag ggttcagtat 

<210> lis 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<4aa> 118 

cttgacctaa tttcaacatc tgg 

<:210> 119 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 119 

atccccaact. caaaaccaca 
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21 



23 



21 



20 



20 



23 
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<210> 120 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 120 

aagtccaatt tagcccacgt t 

<210> 121 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 121 

ccagccattc aaaattctcc 

<210> 122 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 122 

ggtgcaggtc aatttccaat 

<210> 123 
<211> 20 
<212> DNA 

<213> Homo sapiens 



21 



20 



20 



<400> 123 

ccccttcacc accattacaa 



20 



<210> 124 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 124 

tgtccaagga aaagcctcac 



20 



<210> 125 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 125 

aggacetctt gccagactca 



20 



<210> 126 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 12S 

siS^agatgac acaggccaag 



20 
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<210> 127 

<211> 20 

<2X2> DNA 

<.213> Homo sapiens 

<400> 127 

cgcacacctc tgaagctacc 

<2X0> 128 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 123 

acctcactca cacctgggaa 
<210> 129 

<211> 20 ^ 

<212> DNA 

<213> Homo sapiena 

<400> 129 

gcctcctgcc tgaaccttat 

<210> 130 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 130 

caaaatcatg acaccaagtt: gag 

<210> 131 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<4005- X31 

catgcacatg cacacacata 

<210> 132 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 132 

ecttagcccg tgttgagcta 

<210> 133 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 133 

tgcttctatt cagggactcc a 
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20 



20 



20 



2 3 



20 



20 
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<210> 134 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 134 

cccatgcact gcagagattc 2 0 

<210> 135 

<211> ISf 

<212> DNA 

<213> Homo sapiens 

<400> 135 

aaggcaggag acatcgctt 19 

<210> 136 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 136 

gggatcagca tggtttccta 2 0 

<210> 137 

<2H> 20 

<212> DNA 

■c213> Homo sapiens 

<400> 137 

gcttaagtcc cactcctccc 20 

<210> 139 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 138 

attttcctcc gcatgtgtgt 2 0 

<210> 139 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 139 

tcacagaagc ctagccatga 2 0 

<210> 140 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 140 

aacagagcag ggagatggtg 



20 
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<210> X41 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 141 

tctgcacctc tcctcctctg 

<;210> 142 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 142 

actggggcca acatt:aat:ca 

<210> 143 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 143 

cttccccatc tgcaacaaac 

<210> 144 

<211> 2 0 

<212> DNA 

<213> Homo sapiens 

<400> 144 

gctaaaggcc atccaaagaa 

<210> 145 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 145 

tcaagtgcat ctgggcataa 

<210> 146 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 146 

tctgaagtcc attcccttgg 

<210> 147 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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20 



20 



20 



20 



20 



<400> 147 

caafcgtggca tgcagttgat 



20 
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<210> 148 

<21X> 19 

<212> DNA 

<213> Homo sapiens 

<400> 148 

gaagctacca gcccatcct 

<210> 149 
<211> 20 
<212> DNA 

<213> Homo sap leas 

<400> 149 

catttceccc actgtttcag 

<210> ISO 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> ISO 

ccaaggcttit cttcaatcca 

<210> ISl 

<;211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gatccgttta acctgccaac 

<210> 152 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 152 

atgcccctgc caactttac 

<210> 153 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 153 

ctctgcagct gttcccctac 

<210> 154 

<211> 20 

<2125> DNA 

<213> Homo sapiens 
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19 



20 



20 



20 



19 



20 



<400> 154 

tatcaatcca tggccctgac 



20 



wo 00/55318 



<2X0> 155 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 155 

agagtccctg ccctccttct 

<210> 156 

<211> 20 

<212> mtA 

<213> Homo sapiens 

<400> 156 

aaggcagtca gcagtgtcaa 

<aiO> 157 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 157 

£r999aacatc ctgtgcttag 

<210> 158 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 158 

ccattggtga gtgtttccct 

<210> 159 

<211> 20 

<212> DHA 

<213> Homo sapiens 

<400> 159 

agtcagcaaa ctgctgggtt 

<210> ISO 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 160 

attgctccat cctggcataa 

<210> 161 
<211> 23 
<212> DMA 

<213> Homo sapiens 
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20 



20 



20 



20 



20 



<400> 161 

tcatggatga ttttatgtgc ttc 



23 
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<210> 162 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 162 

gcgtgtggaa aagccataag 20 

<210> 163 

<2X1> 20 

<212> DNA 

<213> Homo sapiens 

<400> 163 

gccaatcata caacagccct 2 0 

<210> 164 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<4O0> 164 

tgatcgcata ttctacttgg aaa 23 

<210> 165 

<211> 22 

<212> DNA 

<21.3> Homo sapiens 

<400> 165 

tccctttatt ttagaggcac ca 22 

<210> 166 

<211> 21 

<2X2> DNA 

<213> Homo sapiens 

<400> 166 

gatcaggaat tcaagcacca a 21 

<210> 167 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 167 

tgggttccat aatagagttt caca 24 

<210> 168 

<211> 22 

<212> DNA 

<213> Homo sapiens 



<400> 168 

tgtcagctgt tactggaagt gg 



22 
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<210> 163 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 169 

tgtcagctgc tgcfcggaagt gg 22 

<210> 170 

<211> 21 

<212> DNA 

<213> HoTOo sapiens 

<400> 170 

aggagctggc cgaagccaca a 21 

<210> 171 

<2L1> 21 

<212> DMA 

<213> Homo sapiens 

<400> 171 

aggagctggc tgaagccaca a 21 

<210> 172 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 172 

aatgatgcca ccaaacaaat g 21 

<210> 173 

<211> 21 

<212> DKA. 

<213> Homo sapiens 

<400> 173 

aatgatgcca tcaaacaaat g 21 

<2X0> 174 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 174 

gaggtggctc cgatgaccac a 21 

<210> 173 

<211> 21 

<212> DWA 

<213> Homo sapiens 



<400> 17S 

gaggtggctc tgatgaccac a 



21 
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<210> 176 

<211> 21 

<212> BNA 

<213> Homo sapiens 

<400> 176 

ttccttaaca gaaatagtat c 21 

<210> 177 

<211> 21 

<212> DNA 

<2X3> Homo sapiens 

<400> 177 

ttccttaaca aaaatagtat c 21 

<210> 178 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 178 

ggaagtgttc caaaagagaa a 21 

<210> 179 

<211> 2X 

<212> DNA 

<213> Homo sapiens 

<:400> 179 

ggaagtgttc taaaagagaa a 21 

<210> 180 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 180 

agtaaagagg gactagactt t 21 

<210> 181 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 181 

agtaaagagg aactagactt t 21 

<210> 182 

<211> 21 

<2X2> DNA 

<213> Homo sapiens 



<400> 182 

gcctacttge aggatgtggt g 



21 
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<210> X83 

<211> 21 

<212> DKA 

<2X3> Homo sapiens 

<400> 183 

gcctacttgc gggatgtggt g 

<210> 184 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 184 

cctcattcct cttcttgtga gcg 

<:210> 185 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 185 

cctcattcct cttgtgagcg 

<210> 186 

<211> 21 

<212> DNA 

<2X3> Homo sapiens 

<400> 186 

gcaggactac gtgggcttca c 

<210> 187 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 187 

gcaggactac atgggcttca c 

<210> 188 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 188 

aaaagtctac cgagatggga t 

<210> 189 

<211> 21 

<212> DNA 

<213> Homo sapiens 
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21 



23 



20 



21 



21 



«=400> 189 

aaaagtctac tgagatggga t 



21 
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<210> 190 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 190 

ggccagatca cctcctfccct g 21 

<210> 191 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 19X 

ggccagatca tcitccttcct g 21 

<210> 192 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 192 

acacaccaca tggatgaagc g 21 

<210> 193 

<211> 21 

<:212> DNA 

<213> Homo sapiens 

<400> 193 

acacaccaca cggatgaagc g 21 

<210> 194 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<:400> 194 

Gctggaagaa gtaagttaag t 21 

<210> 195 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 19S 

cctggaagaa ctaagttaag t 21 

<210> 196 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 196 

gctgcctgtg tgtcccccag g 



21 
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<210> 137 

<211> 21 

<212> DNA 

<213:> Homo saplene 

<:400> 197 

gctgcctgtg cgtcccccag g 21 

<210> 198 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 19a 

tagccattat ggaattactg ct 22 

<210> 193 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 199 

tagccattat caattactgc t 21 

<210> 200 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 200 

gatgaagatg aagatgtgag gcggga 26 

<210> 201 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 201 

gatgaagatg tgaggcggga 2 0 

<210> 202 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 202 

aatagttgta cgaatagcag g 21 

<210> 203 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 203 

aatagttgta tgaatagcag g 21 



r 
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<210> 204 

<211> 2X 

<212> DNA 

<213> Homo sapiens 

<400> 204 

acacgctggg ggtgctggct g 21 

<210> 205 

<211> 21 

<212> DNA 

<213>' Homo sapiens 

<400> 205 

acacgctggg cgtgctggct g 21 

<210> 206 

<2ll> 20 

<212> DKTA 

<213> Homo sapiens 

<400> 206 

gaccagccac ggcgtccctg 20 
<210> 207 

<211> 21 , 

<212> DNA 

<2X3> Homo sapiens 

<400> 207 

gaccagccac gggcgtccct g 21 

<;210> 203 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 208 

cattttctta gaaaagagag gt 2 2 

<210> 209 

<211> 22 

<ai2> DNA 

<213> Homo sapiens 

<400> 209 

cattttctta gagaagagag gt 22 

<210> 210 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 210 

gaaaattagt atgtaaggaa g 



21 
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<210> 211 

<2ai> 21 

<212> DNA 

<213> Homo sapiens 



<400> 211 

gaaaattagt ctgtaaggaa g 



21 



<210> 212 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 212 

ccfcccgcctg ccaggttaag cgatt 



25 



<210> 213 
<211> 25 
<212> DNA 

<213> Homo sapiens 



<400> 213 

cctccgcctg ccgggttcag cgatt 



25 



<210> 214 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 214 

tatgtgctga ccatgggagc ttgtt 



25 



<210> 215 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 215 

tatgtgctga ccgtgggagc ttgtt 



25 



<210> 216 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 216 

gtgacaccca acggagtagg g 



21 



<210> 217 
<211> 21 
<212> DNA 

<213> Homo sapiens 



<400> 217 

gtgacaccca gcggagtagg g 



21 
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<210> 218 

<211> 21 

<212> DNA 

<213> Homo sapieas 

<400> 218 

agtatccctt gttcacgaga a 21 

<210> 219 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 219 

agtatccctc ccttgttcac gagaa 25 

<210> 220 

<211> 21 

<212> DNA 

<213> Homo aaplens 

<40a> 220 

ctgggfctcct gtatcacaac c 21 

<210> 221 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 221 

ctgggttcct atatcacaac c 21 

<21Q> 222 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 222 

ggcctaccaa gggagaaact g 21 

<2io> 223 
<2X1> 21 
<2X2> DNA 

<213> Homo sapiens 
<400> 223 

ggcctaccaa aggagaaact g 21 

<210> 224 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 224 

tttaaagggg gtgattagga 



20 
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<210> 225 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 225 

tttaaagggg ttgattagga 20 

<210> 22$ 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 226 

gaagaaattt gtttttttga tt 22 

<210> 227 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 227 

gaagaaattt tttttfcttga tt 22 

<210> 228 

<211> 21 

<212> DWA 

<213> Homo sapiens 



<210> 229 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 229 

gcgggcatcc tgagggaggg g 21 

<210> 230 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 228 

gcgggcatcc cgagggaggg g 



21 



<400> 230 

agggaggggg gctgaagatc a 



21 



<210> 231 
<:2ia> 21 
<212^ DNA 



<213> Homo sapiens 



<400> 231 

agggaggggg actgaagatc a 



21 
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<2X0> 232 

<2X1> 20 

<212> DNA 

<213> Homo sapiens 

<400> 232 

aggagccaaa cgctcattgt 

<2X0> 233 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 233 

aggagccaaa gcgctcattg t 

<210> 234 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 234 

aagccactgt ttttaaccag t 

<210> 235 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 235 

aagccactgt atttaaccag t 

<210> 236 

<211> 21 

<212> DKA 

<213> Homo sapiens 

<40O> 236 

cgtgggcttc acactcaaga t 

<210> 237 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 237 

cgtgggcttc ccactcaaga t 

<210> 238 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 238 

tcacactcaa gatcttcgct g 
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<210> 239 
<ZX1> 21 
<212> DKA 

<213> Homo sapiens 
<400> 239 

tcacactcaa catictuccrct g 

<210> 240 
<211> 21 
<212> DMA 

<213> Homo sapiens 
<400> 240 

gcagcctcac ccgctcttcc c 

<210> 241 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 241 

gcagcctcac tcgctcttcc c 

<210> 242 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 242 

agaagagaat atcagaaatc t 

<210> 243 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<4ao> 243 

agaagagaat gtcagaaatc t 

<210> 244 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 244 

gcgcagtgcc ctgtgtcett a 

<210> 245 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 245 

gcgcagtgcg ctgtgtcett a 
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21 



21 



21 



21 



21 



21 
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<210> 246 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 246 

gatctaaggt tgtcattctg g 21 

<210> 247 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 247 

gatctaaggt ggtcattctg g 21 

<210> 24S 

<211> 23 

<212> DWA 

<213> Homo sapiens 

<400> 248 

cl:.cttct:gt:t agcacagaag aga 23 

<210> 249 

<211> 23 

<212> DNA 

<213> Homo sapiens 



<210> 250 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 2S0 

cattctaggg aticatagcca t 21 

<210> 251 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 249 

ctcttctgtt atlcacagaag aga 



23 



<400> 251 

cattctaggg gtcatagcca t 



21 



<210> 252 
<211> 22 
<212> DNA 



<213> Homo sapiens 



<400> 252 

aagtacagtg ggaggaacag eg 



22 
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<210> 253 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<4.00> 253 

aagtacagtg tgaggaacag eg 

<210> 254 

<211> 22 

<212> DNA 

<2a3> Homo sapiens 

<400> 254 

attcctaaaa aatagaaatg ca 

<210> 253 

<211> 22 

<212> 0NA 

<213> Homo sapiens 

<400> 255 

atticctaaaa agtagaaatg ca 

<210> 256 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 255 

ggcccctgcc ttattattac t 

<210> 257 

<211> 21 

<2X2> DNA 

<2X3> Homo sapiens 

<400> 257 

ggcccctgcc gtattattac t 

<2X0> 2S8 

<2X1> 22 

<212> DNA 

<213> Homo sapiens 

<400> 258 . 

tgagagaatt acttgaaccc gg 

<210> 259 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 259 

tgagagaatt gcttgaaccc gg 
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<2X0> 260 

<211> 21 

<212> DKTA 

<213> Homo sapiens 

<400> 260 

tttgctgaaa caatcactga a 

<2X0> 261 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 261 

tttgctgaaa taatcactga c 

<210> 262 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 262 

aacctcagtt ccctcatctg tg 

<210> 263 

<211> 22 

<212> DWA 

<213> Homo sapiens 

<400> 263 

aacctcagtt tcctcatctg tg 

<210> 264 

<211> 21 

<212> CWA 

<213> Homo sapiens 

<400> 264 

c:tggacacca gaaataatgt c 

<210> 265 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 265 

ctggacacca aaaataatgt c 

<210> 266 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 266 

tcctatgtgt cctccaccaa t 
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<210> 267 
<211> 21 
<212> DMA 

<2X3> Homo sapiens 
<400> 267 

tcctatgtgt gctccaccaa t 

<210> 268 
<211> 21 
<212> DNA 

<213> Homo aapiena 
<400> 263 

aagaagtggc ttgtattttg c 

<210> 269 

<211> 21 

<212> DWA 

<213> Homo sapiens 

<400> 269 

aagaagtggc ctgtattttg c 

<2X0> 270 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 270 

aactgatttg attggtatag ctg 

<210> 271 

<211> 23 

<212> DNA 

<213> Homo aapiena 

<400> 271 

aactgatttg gttggtatag ctg 

<210> 272 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 272 

cagggtccaa cccggacctg a 

<210> 273 

<211> 21 

<212> DNA 

<2i3> Homo sapiens 

<400> 273 

cagggtccaa tccggacctg a 
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<210> 274 

<211> 22 

^2a2> DWA 

<213> Homo sapiens 

<400> 274 

ttgggaggct aaggcaggag aa 

<210> 275 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400s. 27S 

ttgggaggct gaggcaggag aa 

<2ao> 275 
<2H> 15 
<212> DNA 

<2X3> Callus gallus 

<400> 276 
accaggggaa tctcc 

<210> 277 
<211> 15 
<212> DNA 

<213> Gallus gallus 

<400> 277 
accagggaaa tctcc 

<210> 273 
<211> 45 
<212> DNA 

<213> Gallus gallus 
<400> 278 

cgctacccaa caccagggga atctcctggt attgttggaa acttc 

<210> 279 
<211> 15 
<212> PRT 

<213> Homo sapiens 



<400> 279 

Arg Tyr Pro Thr Pro Gly Glu Ala Pro Gly Val Val Gly Asn Phe 
1 s 



S 10 



<210> 280 
<211> 15 
<212> PRT 

<213> Mus mu3CulU3 



<400> 230 



wo 00/55318 



PCT/IBOO/00532 



77 



Arg Tyr Pro Thr Pro Gly Glu Ala Pro Gly Val Val Gly Aen Phe 
1 5 10 15 

<210> 281 

<aii> 15 

<:2%2> PRT 

<213> Gallus gallus 
<400> 281 

Arg Tyr Pro Thr Pro Gly Glu Ser Pro Qly lie Val Gly Asn Phe 
1 5 10 15 



<210> 282 
<211> 15 
<212> PRT 

<213> Gallus gallus 



<400> 282 

Arg Tyr Pro Thr Pro Gly Lys Ser Pro Gly lie Val Gly Asn Phe 
1 5 10 15 

<210> 283 
<211> 45 
<212> DKA 

<213> Gallus gallus 
<400> 283 

cgctacccaa caccagggaa atctcctggt attgttggaa acttc 



<210> 234 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 284 

gcgtcaggga tggggacag 

<210^ 285 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 285 

gcgtcaggga ttggggacag 

<210> 286 
<211> 17 
<212> DNA 

<213> Homo aapiens 
<400> 286 

ccacttcggt ctccatg 



<i210'> 287 
<211> 17 



wo 00/55318 



PCT/IBOO/0OS32 



78 



<212> DNA 

<213> Homo sapiens 



<400> 287 

Gcacttcgat ctccatg 



